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O^l ' Abstract: The guiding-center reduction is studied using gyro- gauge-independent coor- 

5— ( ! 

Ojl dinates. The Lagrangian l-form of charged particle dynamics is Lie transformed without 

■^^ , introducing a gyro-gauge, but using directly the unit vector of the component of the velocity 

^^ I perpendicular to the magnetic field as the coordinate corresponding to Larmor gyration. 

The reduction is shown to provide a maximal reduction for the Lagrangian and to work to 

i-^ I all orders in the Larmor radius, following exactly the same procedure as when working with 

the standard gauge-dependent coordinate. 

jy^ . The gauge-dependence is removed from the coordinate system by using a constrained 

1— H ' variable for the gyro-angle. The closed l-form dO is replaced by a more general non-closed 

^ I l-form, which is equal to dO in the gauge-dependent case. The gauge vector is replaced by 

• 1—t, a more general connection in the definition of the gradient, which behaves as a covariant 

^~>. derivative, in perfect agreement with the circle-bundle picture. This explains some results of 

^ i' previous works, whose gauge-independent expressions did not correspond to a gauge fixing 

but indeed correspond to a connection fixing. 

^ , In addition, some general results are obtained for the guiding-center reduction. The ex- 

r^ ■ 

pansion is polynomial in the cotangent of the pitch-angle as an effect of the structure of the 



Lagrangian, preserved by Lie derivatives. The induction for the reduction is shown to rely 



(N 

rn 

^^ ' on the inversion of a matrix which is the same for all orders higher than three. It is inverted 

and explicit induction relations are obtained to go to arbitrary order in the perturbation 



expansion. The Hamiltonian and symplectic representations of the guiding-center reduction 
are recovered, but conditions for the symplectic representation at each order are emphasized. 
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I. INTRODUCTION 

The dynamics of a charged particle in a strong magnetic field shows up a separation of time- 
scales, with the existence of a fast Larmor rotation, slower drifts, and an adiabatic invariant. This 
can be used to isolate completely the slow part of the dynamics from the single fast coordinate, the 
gyro-angle, and to build a true constant of motion, the magnetic moment [1, 2]. Thus the guiding- 
center reduction brings a slow reduced motion and decreases by two the effective dimension of the 
dynamics. It is the starting point of gyrokinetics, which is a kinetic description of plasma dynamics 
in a strong magnetic field, and a key model in the study of plasma micro-turbulence [3]. The most 
efficient results for the guiding-center reduction are obtained by Lie transforming the Lagrangian 
1-form in an expansion in a small parameter related with the magnetic inhomogeneity at the scale 
of the Larmor radius [2, 4-6]. 

In this paper, we clarify some aspects of guiding-center theory, especially some difficulties 
about the definition of the gyro-angle, by showing that the reduction can be done at arbitrary 
order in the Larmor radius using gyro-gauge-independent coordinates. 

Indeed, the introduction of a scalar coordinate for the angle measuring the Larmor rotation, 
namely the gyro-angle, imposes to choose at each point in the configuration space a direction, an 
axis, in the plane perpendicular to the magnetic field, defined as the zero from which the angle 
is measured. This corresponds to fixing a gauge in the theory, the so-called gyro-gauge, and the 
presence of this gauge in the theory raised some non-trivial questions [7-11], even the bare existence 
of a global choice of gauge can fail. So, it is interesting to consider the reduction from a more 
intrinsic point of view. 

In a previous work [12], we proposed a guiding-center reduction which avoided to introduce 
a gyro-gauge. The idea was to Lie transform directly the equations of motion instead of Lie 
transforming the Lagrangian, as is usually done [13, 14]. This brought a much simplified reduction. 
Especially, it provided the minimal guiding-center reduction which concerned only four coordinates 
(instead of six): the transformation generator had no gyro-angle component. So, for the gyro- 
angle, the initial gauge-independent coordinate could be used, and no gauge fixing was needed. 
This physical coordinate is the unit vector c of the component of the velocity orthogonal to the 
magnetic field, which defines the direction of the perpendicular velocity. 

A limitation of that approach was that it was not suited to non-minimal guiding-center reduc- 
tions, for which the method relying on Lie transforming the Lagrangian appeared as necessary, or 
at least much more efficient. 

For instance, it is interesting to have the slow reduced motion Hamiltonian, but the Hamiltonian 
structure of the reduced model is hard to deal with when working on the equations of motion. In 
addition, the magnetic moment is usually taken as one of the reduced coordinates; this can be 



done by working on the equations of motion, but it is not so straightforward [15]. In a deeper 
way, the freedom involved in the gyro-averaged part of the coordinate change can be employed for 
the reduced motion to be as strongly reduced as possible, to make the guiding-center dynamics as 
simplified as possible. 

Obtaining such a maximal guiding-center reduction by Lie transforming the equations of motion 
is far from simple, especially because it implies to solve non-trivial secular differential equations. 
On the contrary. Lie transforming the Lagrangian 1-form basically relies on algebraic equations, 
and the requirements for a maximal reduction are not much more difficult to get than the minimal 
ones. This method also guarantees that the 4-dimensional slow reduced motion is Hamiltonian, 
by working on a quarter-canonical structure in the Poisson bracket. 

So, the goal is to use the physical gauge-independent coordinate also when Lie transforming 
the Lagrangian, in order to consider a gauge-independent maximal guiding-center reduction. 

The introduction of a vectorial quantity c for the gyro- angle coordinate raises some questions, 
because the coordinate system becomes constrained: the variable c has to remain normalized and 
perpendicular to the magnetic field. Changing the spatial position q implies to change the coordi- 
nate c at the same time. This induces a connection for a covariant derivative on a space-dependent 
circle, which is related to the circle-bundle picture underlying in the gyro-angle coordinate [11, 16]. 
It was already present in [12] when Lie transforming the equations of motion, but it is more in- 
volved to deal with for the full guiding-center reduction, because the coordinate c will be changed 
as well, and not only derivatives or vector fields are involved, but also differential forms. 

The resulting reduction will naturally provide gauge-independent results, whereas in the usual 
approach, they were obtained only for a part of the reduced quantities. This fact can shed inter- 
esting light on previous guiding-center results, especially those related to gauge invariance. For 
instance, in the usual approach, the gradient is not gauge-independent, and the reduced Pois- 
son bracket involves a gauge- independent corrected gradient. A comparison with the results of 
the gauge-independent formulation is a way to get an intrinsic interpretation for this corrected 
gradient. 

The results of the gauge-independent formulation can also be used to explore other questions 
about the gauge-dependent approach, for instance related to gauge arbitrariness and anholonomy 
[5, 7]. To avoid confusion, they will be the topic of another paper [17]. Here, we show how a 
maximal guiding-center reduction can be derived in a gauge-independent formulation to arbitrary 
order in the Larmor radius. 

The proof relies on explicit induction relations to all orders, because the induction can be 
written as a matrix product, with some coefficients being differential operators. Through inversion 
of this matrix, a maximal reduction can be studied, towards a more complete reduction and a 



more general viewpoint on special reductions considered in previous works, such as the so-called 
Hamiltonian and symplectic representations identified in [18]. 

In the derivation, the cotangent of the pitch-angle is used as the coordinate corresponding 
to the parallel velocity (component of the velocity parallel to the magnetic field), since this 
coordinate simplified computations for the minimal guiding-center reduction in [12] and made all 
formulae polynomials. This will clarify why this polynomiality can be observed in the results of 
the full guiding-center reduction as well. 

The paper is organized as follows. In sect. 2, a few facts are reminded about the initial dynamics, 
the choice of coordinates for the gyro-angle, the method of Lie transforming the Lagrangian 1-form, 
and the hierarchy of requirements involved in the guiding-center reduction. 

For the sake of completeness and clarity, the mechanism of Lie transforming the Lagrangian 
through an expansion in a small parameter is described in an appendix, with emphasis on the 
three steps it involves: an initialization for the lowest orders, whose choices are the key to make 
the reduction work and possibly be optimal; an algorithm which applies for higher orders, is purely 
mechanical and can be applied to study the reduction to arbitrary order; and an intermediate step 
in between. 

In sect. 3, the method is applied to the guiding-center reduction in case the gyro-angle co- 
ordinate is chosen as the physical variable c. The derivation is written in matrix form, which 
emphasizes both the lowest-order choices that allow the reduction to work and to be maximal for 
the Lagrangian, and the algorithmic character of the procedure at higher orders. The full deriva- 
tion is explained because we are interested in the reduction at arbitrary order in the Larmor radius, 
which implies to use all the ingredients of the detailed mechanism at work. For orders lower than 
3, the procedure follows the same lines as when working with the gauge-dependent coordinate, 
but formulae have to be used in their intrinsic version, for instance because the basis of 1-forms 
involves non-closed 1-forms. Finally, for orders higher than 2, explicit formulae are given for the 
induction relations, allowing to go to arbitrary order, and to give a unified framework where recent 
results may seem to discord somehow with each other [18, 19]. 

In sect. 4, the results are compared with previous works, either Lie transforming the equations 
of motion, or using a gauge-dependent gyro-angle. 

For the sake of simplicity, we consider the special case where there is no electric field, but the 
generalization for a non-zero electric field is straightforward, as will be shown in subsect. IV A. 

II. COORDINATES, METHOD AND REQUIREMENTS 

The dynamical system is simply a charged particle with position q, momentum p, mass m and 
charge e, under the influence of a static inhomogeneous magnetic field B. The motion is given by 



the Lorentz force 
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When the magnetic field is strong, the motion imphes a separation of time-scales. This is best 
seen by choosing convenient coordinates for the momentum space, for instance 

P-= IIpII : 

pb ' 



99 := arccos (^-it) , 



c : = 



where b := j^ is the unit vector of the magnetic field, and p_L := p — (p-b)b is the so-called 
perpendicular momentum, i.e. the orthogonal projection of the momentum onto the plane B 
perpendicular to the magnetic field. The coordinate p is the norm of the momentum; the coordinate 
if is the so-called pitch-angle, i.e. the angle between the velocity and the magnetic field. The last 
coordinate c is the unit vector of the perpendicular velocity. 
Then, the equations of motion write 

p = 0, 

Vi = -^-Vb.c, 

c = -— a - £--Vb-(cb + aa cot w) , 

mm ^ ~ ' ' 

where p is now a shorthand for p(b cos 99-1-0 sin 99), the norm of the magnetic field ||B|| is denoted by 
B, and following Littlejohn's notations [4-6], the vector a := b x c is the unit vector of the Larmor 
radius, so that (a, b, c) is a right-handed orthonormal frame (rotating with the momentum). 

In the case of a strong magnetic field, the only fast term is the Larmor frequency uji^ := ^. 
Writing the dynamics as z-c^z, all other terms as uJi, appear to be of order —V, which means 
that the small parameter of the theory is of order ;;£;— V = ^V. A more detailed study (e.g. 
[1, 2, 5, 12]) shows that it is rather 

, psinip Y7 

^ ■~ eB ^ ■ 

It is an operator, but the gradient V has only a meaning for orderings; it acts on the magnetic 
field and can be given a more precise meaning by relations such as V ~ nTv^^^lF" '^^^ ordering 
parameter e is related to the magnetic inhomogeneity at the scale of the Larmor radius tl '■= ^^f^- 
By abuse of language, it is often considered as just the Larmor radius, or as the inverse charge e^^ 

[1, 2, 5]. 



A. Choice for the gyro-angle coordinate 

The Larnior frequency cjl := ^- concerns only one coordinate, c, the direction of the vector 
p_L in the 2-dimensional plane perpendicular to the magnetic field. This corresponds to an angle, 
the so-called gyro-angle, and measures the Larmor gyration of the particle momentum around the 
magnetic field. 

To get a true scalar angle instead of the vector c, one chooses at each point q in space a direction 
which will be considered as the reference axis ei(q) G B (q). This corresponds to fixing a gauge 
in the theory, the so-called gyro-gauge. Then, the gyro-angle 9 is defined from the oriented angle 
between the chosen reference axis ei(q) and the vector c through the following relation: 

c = — sin 9ei — cos 9e2 ■ (1) 

The equation of motion for 9 is 

^ = ^ + cot w^-Vb-a + £--Vere2 , 

rn " m m J- ^ ) 

with 62 := b X 61 the unit vector such that (b, ei , 62) is a (fixed) right-handed orthonormal frame [5] . 

The coordinate 9 is not intrinsic, it depends on the chosen gauge ei(q), which raised some 
questions about the gauge- invariance of the theory, about the failure of global existence for ei , as 
well as about the presence of an anholonomic phase in the coordinate system [5, 7, 11, 20, 21]. 
All these difficulties originate because 9 is not given by the physics, it is not a purely intrinsic 
coordinate. For physical results, what is needed is not 9 but only c. This is clearly illustrated 
by all the results of guiding-center theory, e.g. [1, 2, 5, 6], where 9 intervenes only through the 
quantity c everywhere (except in its own definition and in subsequent relations). So, we will avoid 
this coordinate and keep the corresponding initial variable, c, as in [12]. The quantity 9 will 
be used only with a symbolic meaning for the fast angle, or when making comparison with the 
gauge-dependent approach. 

The use of a unit vector avoids having to fix a gauge for the zero of the angle, and it allows to 
work with a physical quantity: the unit vector of the perpendicular velocity 

c := /^ = E^Mb^pi ^2) 

which indeed corresponds to the direction of the perpendicular velocity, and is a coordinate mea- 
suring the Larmor gyration. It is an angle, since it is a unit vector in a plane, namely the plane B , 
orthogonal to the local magnetic field. But this unit vector is immersed into M^, which means it is 
in S^(q). This spatial dependence implies that the coordinate space is constrained: the gyro-angle 
c is not independent of the spatial position. 



When the coordinate q is changed, the coordinate c cannot be kept unchanged, otherwise it 
may get out of B^. Differentiating relation (2) with respect to q gives 

Vc = -Vb-(cb + aacot99). (3) 

This formula can be obtained more easily by noticing that in the change of coordinates (q, p) — > 
(q,p, (p, c), the following relations hold 

— sin (pVip = V cos 99 = Vb-— = Vb-csin99 , 



p 
=> \7(p = -Vb-c, 

= Vc sin 99 + Vb cos f + Vipl—b sin 99 + c cos (p) 
= (Vc + Vb-cb) sin ip + Vb-(1 — cc) cos 99 , 
^ Vc = -Vb-(cb + aacotv9), (4) 

in which V means differentiation with respect to q while keeping p constant, and we used that 
1 = aa + bb + cc and that b is a unit vector, which implies Vb-b = V ( ^ I =0. 

Formula (3) is not well defined where (p = (mod vr), i.e. where p is parallel to B. But this 
is no trouble, since it fits in with a usual limitation of guiding-center theories. For instance, 
guiding-center transformations are not defined where ip is zero, since they involve many sine/? as 
denominators [1, 2, 5, 6]. At the points where (p = (mod vr), the vector c itself is not defined, 
neither is the coordinate 0. It is an implicit assumption in all gyro-kinetics and guiding-center 
works that those points are excluded from the theory. 

If the coordinate 6 was used, formula (3) would be replaced by 

Vc = -Vb-cb-FVere2 a, (5) 

where Vei-e2 is the so-called gauge-vector, which is usually denoted by R, depends only of the 
position q, and is related to the choice of gauge. 

To fit in with both coordinates, we define a more general function R(q, p): 

Kg := Vc-a . 

Then, in any coordinates, the previous formulae (3) and (5) write 

Vc = -Vb-cb + Kga . (6) 

The value R^ = — Vb-aacot99 corresponds to the physical definition of c, and the corresponding 
formula (3). The value R^ = Vei-e2 = R will be linked with the usual case relying on the 
gauge-dependent coordinate 6, according to formula (5). 



The spatial dependence of the vector c through formula (6) must be taken into account each 
time a gradient acts on a function that depends on the fast angle c, e.g. in total derivatives, in Lie 
transforms, and when computing the action of a spatial component of the change of variables. 

B. Lie transforming the Lagrangian 1-form 

The goal is to isolate the slow part of the dynamics from the fast angle 9 (or c) , by performing 
a near-identity change of coordinates such that the dynamics of the remaining coordinates does 
not depend on 9; this averaging procedure is the primary requirement for the guiding-center 
reduction. We showed in [12] how it can be derived by Lie transforming the equations of motion 
when keeping the physical coordinate c. It appeared to be very straightforward for the minimal 
guiding-center reduction, but the additional requirements (e.g. the use of the magnetic moment 
as a coordinate or a further simplification of the reduced dynamics) were not so easy to obtain. 
They are more efficiently obtained by Lie transforming the Lagrangian 1-form, which relies on the 
Hamiltonian structure of this dynamical system. 

The corresponding Hamiltonian function is just the particle kinetic energy 

H ■= ^ 

The Poisson bracket is non-canonical, it contains the gyro-magnetic coupling term 

{F, G} = dqF-dpG - dpF-dqG - dpF-eB x dpG . (7) 

Together, the Poisson bracket and the Hamiltonian induce the equations of motion through 
Hamilton's equations 

± = {z,H}, (8) 

where all the coordinates were grouped in a single phase-space vector z := (p,q). 

Instead of working with the Poisson bracket and the Hamiltonian, it is easier to work on the 
Poincare-Cartan 1-form T [6]. This last is usually called just a Lagrangian 1-form, or simply 
a Lagrangian. It concentrates all the information on the Hamiltonian structure into one single 
quantity, which in addition is much less constrained than a Poisson bracket. It is a 1-form, defined 
over a 7-dimensional space y := (p, q, t) by [2, 6, 14] 

T ■.= {eA + p)-dq- Hdt, (9) 

which yields a variational formulation of the dynamics with the action [5] : 

A:-- 



The symplectic part of T is Tg := (eA + p)-(iq [2, 6]. It is a l-forin Tg = Vgdz'^ in the usual 
6-dimensional phase-space, which gives the Lagrange 2-forni through exterior derivative 

ujg ■= dVs = {diVi - djVi) dz' (g> dz^ , (10) 

where ® means tensorial product, and Einstein's convention is used: there is an imphcit summation 
over repeated indices. In turn, ujs is invertible, and its inverse gives the Poisson bracket 

J := w^S 
with JI the bivector defined by the relation {F, G} = diFT^djG. 

In the presence of a strong magnetic field, the small parameter e allows for a perturbation 
expansion of all quantities. The Lagrangian T from formula (9) can be written 

r = r_i + ro, 

where the index refers to the order in the magnetic field, (or in e~^, following Northrop's work [1]) 

r_i:=eA-dq, 
ro:=^-dq-^cit, (11) 

whose ratio is indeed of order e. The lowest order being —1 reminds that the guiding-center is a 
singular perturbation theory. 

The Lagrangian is transformed by the exponential of a Lie transform 

r — >r :=e^xr, 

where £x is the Lie derivative along the vector field X, whose inverse is the generator of the 
near-identity coordinate transformation: 

y — ^y :=e~^y. 

The generator can be expanded X = Xi + X2 + ..., where X„ is the term of X that is of order 
n in e. In fact, it is equivalent but simpler to replace the single transformation g^i"'' ^2+--- with 
a complicated generator by a series of transformations with a simple generator for each of them 

y^y:=...e-^2g-Giy^ 

where — G„ is the vector field generating the n-th transformation, and is purely of order e". The 
Lagrangian then transforms as 

r — ^r := ...e'-^e'-ir, 



10 



where L„ := £g„ is the Lie derivative along the vector field G„. Notice that £x / Li + L2 + ... 
because G^ / X„. 

As the Lagrangian is time-independent, it is interesting to use time-independent perturbation 
theory, by imposing a transformation that does not depend on time and does not afi"ect the time 
coordinate: G is constant in time and G* = 0. 

This implies that the Hamiltonian will transform as a scalar function. Indeed, for any time- 
independent vector field G and any 1-form, 

CgT' = [iG-dr]t + [d{iGr)]t 

= G-dyrt + dt{GyT = G-dyTt, 

where formula (13) was used for Lie derivatives, and formula (10) for exterior derivatives (here in 
the 7-dimensional space). 

Another consequence is that at each order n in e, there will be seven requirements (see next 
subsection), one for each component of the reduced Lagrangian r„, and only six freedoms, one for 
each component of the time independent transformation generator G„. One freedom is missing 
in G„ and must be looked for elsewhere. Now, the Lagrangian is defined only to within a total 
derivative, since only its exterior derivative has a physical meaning, and d{T + dS) = dT + d'^S = dT 
for any function S, which is called a gauge function [2]; be careful, this gauge has nothing to see with 
the gyro-gauge nor the gyro-angle, it is just an arbitrariness in the definition of the Lagrangian. 
The freedom embodied in S is needed to obtain the maximal reduction, since it gives the expected 
seventh freedom. Then the reduced Lagrangian is : 

It will be determined order by order in e": 

r_i = r_i + dS-i (12) 

To = Lir_i + To + dSo 

Ti = (l2 + ^) r_i + LiFo + dSi 

T2 = (L3 + L2L1 + ^)r_i + (l2 + ^) To + dS2 



L4 + L3Li + L2(^ + ^) + S 



r_i 



+ L3 + L2Li + ^ To + dSs 



In principle, these are differential equations for G, because the action of a Lie derivative Cg 
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over a 1-forni 7 writes 

Cg^ = {icd + dic)^ , (13) 

where the operator ic is the interior product, e.g. 

iG7 = G-7 = 7(G) . (14) 

So, the first term in (13) is algebraic in G, but the second one is differential in G. 

However, the differential operators can be avoided by the following argument. The last term 
in (13) involves an exterior derivative, and can be removed by redefining the gauge function S. In 
addition, formula (13) together with the property d^ = imply that the exterior derivative and 
the Lie derivative commute: 

Cad = dCc = d icd . 

This means that for any vector fields X and G 

CcCyj: = {icd + diG)Cx.T = iGd{iy^d + diyi)T + d{iG,CyJ-) 
= iadixdT + d^iaCxT) . 

In computations for T, the last term can again be removed by redefining the gauge function S. 
By induction, it is now easy to see that in formula (12), exponentials of Lie derivatives can be 
considered as just exponentials of interior products provided the gauge function Sn is defined in a 
convenient way at each order e" to absorb all the exterior derivatives involved in equation (12): 

CG„^CG„,^...CG„^T + dS 

= iiG^^d){iG^^d)...{zG„J)T + dS'. 

For the following, we will redefine S according to this rule, but for simplicity, we drop the prime 
and write S for S'. 

Using this rule and the fact that in the equation for r„, the rj<„ are already known, equations 
(12) can be written 

r_i = r_i + ds_i (15) 

To = Gi-uj^i + To + dSo 

Ti = G2-CJ-1 + ^-(wo + ^0) + dSi 

Gi 



Ts = Gs-uj-i + G2-LO0 + ^■d[Gr{2uJo + cJq)] + dS2 

2 ^-1 



Tg = G4-W-1 + Ga-ZJo + Gs-ZJi - ^^^ 



+ ^^rd^Gv{3ojo + uJo) + dS3 



12 



where the notation G- := ic, is used for the interior product, as in formula (14). In addition, the 
n-n-order Lagrange 2-forni was defined in the natural way: 

In the next subsection, we study the properties that are wished for T. In the next section, the 
unknowns G„ and Sn will be determined such that r„ has those desired properties. 

C. The hierarchy of requirements 

A) The primary requirement for the guiding-center reduction is to isolate the slow dynamics of 
the coordinates (q, '^) from the fast gyro-angle 6. From the point of view of the Lagrangian F, it 
may be obtained by making F independent of 9. This is actually stronger than the strict minimal 
requirement, since it implies to average the dynamics of 6 as well. However, in the Lagrangian 
approach, contrary to when working on the equations of motion, the minimal requirement would 
be difficult to get, if not impossible, and it is quite easier to average all the reduced dynamics. 

So, the goal is that the reduced Lagrangian does not depend on the reduced gyro-angle, which 
means that all its non-zero Fourier components (i.e. purely oscillatory terms) are zero: 

osc(r„)=0, (16) 

where following Littlejohn's notations, osc = 1 — avg is the projector onto gyro-fluctuations, with 
avg the complementary projector onto gyro- averages: 

2tt 



Jo 



for any function /. This average can be computed without introducing any gauge: either using 
the intrinsic calculus introduced in [15], either using the matrix calculus introduced in [12]. Also, 
the coordinate 9 can be used as an intermediate quantity for this computation, which is made 
at constant q, so that the presence of a gauge (only for the intermediate computation) is of no 
consequence. 

B) Averaging the motion or the Lagrangian does not determine the average components of the 
coordinate change, as is clear in [12], for instance. This lets some freedom in the procedure and 
suggests to impose stronger requirements for the reduction. The basic idea is to use the available 
freedoms to make the reduced dynamics as simplified as possible. 

A natural prospect is to make trivial the reduced dynamics 

i^ =0, 

for some components j, by including constants of motion in the reduced coordinates. For the 
remaining coordinates, one can consider putting their reduced dynamics to zero just for orders 



13 



higher than 2 or 3 for instance: 



^n = 0, 



for all higher orders, where the index n refers to the order in e" and the exponent j indicates the 
component of the vector z. 

When this is achieved, the reduced dynamics is given just by lowest-order terms; it is exactly 
known after the lowest orders have been derived, without computing the reduction at higher orders, 
which are useful only to determine the transformation. 

In the procedure working on the Lagrangian 1-form, the "components" are not the ones of the 
reduced equations of motion, but the ones of the reduced Lagrangian 

rl = o, (17) 

for higher n. For differential forms, we use the same convention as for vectors: the index n indicates 
the order in e" and the exponent j refers to the component. It departs from the usual notation 
r„ = {Tn)jdz^ , but it avoids excessive use of parentheses. 

Equation (17) gives additional (i.e. non minimal) requirements for the reduced Lagrangian T 
and it is used to determine the averaged transformation generators. When it can not be obtained 
completely, the goal it to obtain it for as many components j as possible, to get what can be 
considered as the maximal reduction. 

C) When studying the Lie transform of the Lagrangian 1-form, one variable, namely the mag- 
netic moment JL, plays a key role, as the variable conjugated to the gyro-angle. 

Basically, including JL among the reduced coordinates is a way to obtain a more efficient 
reduction process by making one of the components of the reduced Lagrangian trivial JI := T . 
Indeed, when the derivation is performed with the variable p, then the ^-component of the 
Lagrangian is given by a whole series, which is the magnetic moment. Changing the variable p in 
such a way that the new variable absorbs this series is a way to have the reduced ^-component 
trivial, just given by a coordinate. This also simplifies the reduction algorithm, by providing a 
simpler expression for cJ, which will play a key role in the derivation. 

Second, the resulting variables 6 and JL are conjugated, which implies that the magnetic moment 
is a constant of motion besides the norm of the momentum p. And this conserved quantity is 
preferable to the variable p because, unlike p, it remains an adiabatic invariant in the presence of 
a wide class of electric fields. 

The variable JLis a whole series in the Larmor radius, and its lowest-order term is the well-known 
adiabatic invariant /U often confounded with JL 

^ - ^ •- 2mB ■ Uo) 
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So, an interesting additional requirement is to include the magnetic moment in the reduced 
coordinates. For the equation to be solved, this requirement is expressed by 

t:=Ti. (19) 

Indeed, then the reduced Poisson bracket verifies 

F^ = 1, andF' = 0, (20) 

for i ^ 9, which means that the dynamics of JL is zero 

ji = {H,-fl} = -deH = G, 

since the Hamiltonian H does not depend on ^, as a consequence of (16). 

There is a third reason for including the magnetic moment among the reduced coordinates and 
requiring F = /i. It is concerned with the Hamiltonian structure of guiding-center dynamics. The 
6-dimensional reduced motion z is Hamiltonian, since it is just the transform of the Hamiltonian 
motion z. But the true reduced guiding-center motion is the 4-dimensional slow motion (q, '^). It 
is the truncation of the full dynamics z, but truncations of a Hamiltonian dynamics are in general 
not Hamiltonian. However, in some cases, truncations are automatically Hamiltonian, and a 
special case is the quarter canonical structure of the Poisson bracket, defined by conditions (20). 
This can be seen by imposing Dirac's constraints (/I, 6) to the reduced dynamics, or by verifying 
that the truncated bracket is actually just given by starting from the initial Lie algebra of all 
functions of the phase-space /(z), and taking the sub-algebra of functions that do not depend 
on the magnetic moment f{q,Tp,6). Thus, including Jl in the reduced coordinates is a way to 
guarantee the reduced slow motion to be Hamiltonian. 

The requirement on the magnetic moment fixes one of the freedoms involved in the average 
components of the coordinate change. The other freedoms are used to make the reduced dynamics 
as simple as possible, by putting to zero as many average components of F„ as possible for higher 
n, as indicated by formula (17). 

D) To sum it up, the guiding-center reduction involves a hierarchy of requirements: The primary 
requirement (minimal reduction, with an averaged reduced dynamics) is to remove the fast time- 
scale by averaging the Lagrangian over the gyro-angle; the corresponding equation is (16). The 
secondary requirement (intermediate reduction, with a constant of motion and a Hamiltonian 
slow reduced motion) is to include the magnetic moment among the reduced coordinates by the 
quarter-canonical structure; the corresponding equation is (19). The third optional requirement 
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(maximal reduction, with a simplified reduced dynamics) is to use the remaining freedoms to make 
the reduced Lagrangian as simplified as possible; the corresponding equation is (17). 

This makes seven requirements at each order in e, one for each component of the reduced 
Lagrangian r„ in formulae (12), and seven freedoms are needed. For a time-independent transfor- 
mation, those are G^ and Sn, as announced in the previous subsection. 

III. DERIVATION OF THE REDUCTION 

Let us now turn to the guiding-center reduction. The details of the procedure as well as the 
practical computations may seem intricate and they hide somehow that the basic ideas of the 
reduction are very elementary. It is why the principles and general lines of the procedure are 
presented in the appendix, to give a clear view of the reduction process. 

In this section, the three stages of the method presented in the appendix are shown to work 
with the coordinate c in a similar way as with the standard approach relying on a gyro-gauge. The 
transformation at lowest orders is computed for comparison with previous works, and it is shown 
how the reduction can be performed to arbitrary order in the Larmor radius by obtaining explicit 
induction relations. 

Each order of the derivation can be given several numbers. For instance, what is usually called 
the first order is the order just after the lowest order. For the Lagrangian, it corresponds to Fq 
(since the lowest order corresponds to r„i), which is rather considered here as the order 0. In 
addition, the order in the various quantities will be mixed up: for instance, in the derivation, the 
order involving F2 will be the equation for Gg'"^, as well as for G2 and also for G^ For the sake 
of clarity, we will always consider the order n as the one corresponding to F„ (or rather r„), and 
we will often use the expression "at order r„", instead of "at the order corresponding to r„". 

A. Preliminary transformation and initial setting 

The goal is to solve equation (15) for the guiding-center reduction, with the Lagrangian 
(11), and with the requirements (16)-(19) for the averaging reduction, for the magnetic moment 
reduction, and for the maximal reduction. 

First of all, the change of coordinates from the norm of the momentum p to the magnetic 
moment JI is not near identity, as is clear in equation (18). Before beginning the reduction, a 
preliminary change of coordinates must be done, so that all the remaining transformation will be 
near identity. A suitable preliminary change of coordinates is 

(q,V'>P>c) — > {q,ip,fi,c), 
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where /x is the zeroth-order magnetic moment 

_ (psiny)^ 
^ 2mB ' 

which is a weh-known adiabatic invariant, and often confounded with /I. In the new variables, the 
Lagrangian becomes 



r := [eA + ^/2jj,mB{b cot cp + c)] -dq - ^5(1 + cot^ ip)dt . 

Interestingly, the pitch-angle (p intervenes only through its cotangent, which was mentioned in 
[12] as making all quantities polynomials. Here, this feature is obvious in the Lagrangian, and it is 
preserved by derivatives, so that it will be preserved throughout all of the derivation. Actually, the 
magnetic moment makes the polynomiality still more accurate than in [12], where the variables c/? 
and p were used, and the Larmor-radius prefactor ri = psimp was not polynomial in cot f. Here, 
the magnetic moment /U absorbs the p sin ip and all formulae will be purely polynomials in cot ip 
and monomials in y^, which is useful to simplify computations. 

So, we actually choose to change coordinates according to 

(q,¥',P,c) — > (q,(/>,^,c), 
with 

(p := cot if 

the variable that makes all formulae polynomials. 

Also, the structure of the Lagrangian shows that one can make the coefficients e and m disappear 
by noticing that the magnetic field B appears only through eB, provided n is considered as 
appearing only through fim/e, and dt appears only through dt/m. 

The particle charge e is usually kept in guiding-center works because the order in e~^ indicates 
the order in e [1]. Here, it is useless since the order in e is already indicated by the order in other 
quantities: r„ and r„ are of order ^ I a/-^V ) . The reason is that all quantities will be series 



in r^V = a/ -^V, as a result of the structure of the Lagrangian, and as will be confirmed by the 
derivation, e.g. formulae (30), (41)-(43), (53)-(55), (56)-(57), etc. Hence the order can be readily 
obtained by the overall order in \AB or V. These last two quantities have the drawback of being an 
operator, or a space-dependent function; in addition, their order is only dimensional, e.g. ^ is 
of order V^i?^'^. For a readily control of the order, it is useful to have a scalar parameter, which 
was considered as e~^ in previous works. Here, this role can be played by y^, since all quantities 
will be monomial in it. Thus, keeping e to indicate the order of expansion is indeed not necessary. 
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Thus, we make the scahng 



A- 


-^ A 


■.= eA 




B - 


^B 


■.= eB 




z^- 


-^^: 


fj,m 
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(psinip)'^ 
2eB 


t- 


-^t:-- 


_ t 
m ' 





which avoids unnecessary coefficients in the derivation. It agrees with the physics, where the effect 
of the magnetic field on particle dynamics always includes the coupling constant e. For simplicity, 
we will drop the underline, e.g. we will write B for _B. The Lagrangian becomes 



r := [A + yj2iiB{h cot ip + c)] -dq - ^5(1 + cot^ ip)dt . 
The derivation starts from the Lagrangian with the expansion (11) 

r := r_i + To , 

with 

r_i = A-dq, 



ro = v^2^(b(/. + c)-(iq-A*5(l + (/>2)dt. (21) 

It can be divided into its average and fiuctuating part 

avg(r_i) = r_i = A-dq 



avg(ro) = ^/2fiBb(l)-dq^ - ^iB{l + (t?)dt 



osc(ro) = V2/lBc-dq. (22) 

The previous section showed that the process involves the Lagrange 2-form c<;„. It can be split 
in three basic terms 

bj = dV = a;_i + wq + Wo , (23) 

with 

u-i := dr_i = d{A)- A dq 



ojQ := dosc(ro) = d{y^2^Bc)- A dq 



tJo := (iavg(ro) = d{^/2fiBb(l))- A dq - d{fiB{l + (j)'^)) A dt , 
where the symbol A denotes the antisymmetry operator 

da.b. Adc = da.b.dc — dc.b.da , 
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for any matrix b and any vectors a and b. 

The first term in (23) is the lowest-order Lagrange 2-form, related to Larmor gyrations. The 
second (resp. third) term in (23) is the exterior derivative of the oscillating (resp. average) zeroth- 
order Lagrangian. Be careful, this is not the oscillating (resp. average) zeroth-order Lagrange 
2-form; for instance 

cJo := '^(osc(ro)) / osc(ti;o) = osc((iro) , 
because the exterior derivative does not preserve gyro- fluctuations. 

The contributions (23) to the Lagrange 2-form are explicitly given by 

a;_i : = dV^i = dq-V(A)- A dq = (iq-(-B) X dq (24) 



Wo 



davg(ro) = d{y/2fiBh(p)- A dq - d(^5(l + cp"^)) A dt 
= y/2^ |#b + dq-Vb + [^^^ + 0] (/>b} • A dq 
- {(1 + 02)(5d/i + lidB) + 2^5d0} A ^ 
ojQ : = dosc(ro) = d{\/2^Bc)- A dq 

= y^2jlB [dc + [^^ + 1^] c} • A dq . 

Now, dc involves two contributions: one corresponding purely to the gyro-angle, in which the 
variable c is changed at constant q, and a second one coming from a change in the coordinate q, 
which comes because the gyro-angle is a constrained coordinate, as mentioned about formulae (3) 
and (6). The second part is given by 

dq-Vc = — dq-Vb-cb + dq-R^a . 

The first one is then (dc — dq-Vc), but it is written more precisely as (dc — dq-Vc)-aa since when 
q is constant, the variation of c can only be in the direction of a, i.e. along the circle S"'^(q). On 
the whole 

dc = —dq-Vb-cb + dq-R^a + (dc — dq-Vc)-aa . 

This shows that, in the gauge-independent approach, the natural form corresponding to d9 is 

(50:=-(dc-dq-Vc)-a, 

where the minus sign comes to agree with the usual convention (1) on the orientation of the 
gyro-angle. Unlike d9, this form is not closed in general; it is why it is written 69. In the 
gauge-dependent case, it is closed and indeed corresponds to d9, as a consequence of (5). 
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As the reduction process mainly relies on inversions oi oj = dT (or rather uJ), which can be 
viewed as a matrix inversion, a matrix notation is well suited and makes the discussion clearer. 
This implies to choose a basis of 1-forms; the derivation of co above shows that a natural basis is 



c-dq, a-(iq, b-dq | d(j),dfi, 



dt 



(25) 



where a vertical dash | is put to separate the space-, the momentum- and the time-components. 
The choice of —dt makes the corresponding coordinate H instead of —H. 

Those 1-forms are not closed, unlike the standard dz'\ but it is not needed, provided one is 
careful of using intrinsic definitions for the operations involved in the procedure (see formulae (28) 
and (48) for instance). 

In this basis, formulae (24) become 
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(26) 
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where for visual purpose, vertical and horizontal lines are used to separate the position-, 
momentum- and time-components. The spatial part of the matrix ZJq (resp. Cjq) is just a vec- 
tor product vx, with the vector v := la + Jh + Kc (resp. v := la + Jb + Kc), where the 
coefficients are 



J:-- 

K :-- 

i :-- 

J :-- 

k :-- 



a-V X {y/2nB(j)b) -- 


= 0(^ - b-Vb-c) 


b-V X {^j2^B(t)h) 


= (pfa-Vb-c- c-Vb-a 


c-V X {^/2^B(t)h) -- 


= (/)fb-Vb-a-^' 


a-V X {y/2fiBc) = 


a-Vba-cVb-c 
2 


b-V X (\/2/i5c) = 


3-VB _ p> 
2B ^"-"-S 



-C-V X (V2/IBc) = a-Vb-c + b-Rg . 
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B. Immediate orders 1 and 

At lowest order n = — 1, the reduced Lagrangian (15) writes 

r_i = r„i + ds.i . 

The change of variable has no effect at this order, since it is near-identity. But the averaging 
condition is trivially verified; it is actually a condition for the near-identity Lie transform to 
remove the fast time-scale, which is possible only when at lowest order, the Lagrangian is already 
gyro-averaged. Now, the only freedom involved S-i can not be useful and is set to zero 

At the following order n = 0, the reduced Lagrangian is 

To = Gvu^-i + ro + dSo. (27) 

To write vectors in matrix form, a basis has to be chosen for vector fields also. It is most convenient 
to choose the dual of the basis (25) for 1-forms, so that the coupling Gi-a;_i is computed as a 
standard matrix product. The desired basis is easily identified as (using the natural isomorphism 
between vector fields and differential operators) 

(c-V, a-V, b-V I d^, d^, -a-dc \ - dt) , (28) 

because V can be written 5q|(- + 9qC-3c|q. The operator —a-dc is the generator of Larmor gyrations, 
as shown in [15]. It just an intrinsic definition of the usual dg. The set (28) is actually the natural 
basis for vector fields, which confirms the relevance of the basis (25) for 1-forms. 

Be careful that the chosen bases are not averages and they must be taken into account when 
computing averages or fluctuations of a quantity. For instance, it could seem that a vector (resp. 
a 1-form) with components (1,0,0 | ,0,0, | 0) is averaged, whereas it is not, since it is equal to 
c-V (resp. c-(iq). 



Then, equation (27) is easily computed in matrix form 






= Gi-cj-i -t-Fo -1- dSo 










= ( -BGl BG\ 1 








1 




+ ( V'2^iB V'2^iB(t> 1 








)iB{l + <t)^ 



(29) 



+ ( c-V5'o a-VSo b-V5o | d^So 9^5,, -a-d,So \ ) 

where, as usual, 1-forms are written as 1 * 7 matrices: each column is an equation to be solved for 
the freedoms Gi and 5*0 in such a way that Fq satisfies the desired requirements. 

The lowest-order Lagrange 2-form a;_i is linked to Larmor gyration, as appears in (24). It 
is not invertible. Only two components of Gi are involved in the equation: G^ and Gf; and 
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only two components of T can be controlled by these freedoms: T^ and F]^. As announced in the 
appendix, the inversion is possible only under some conditions on the term Rq = — Fq + Tq + dSo, 
and the solution, if it exists, is not unique. 

More precisely, for the averaging requirement (16), the condition of the right-hand side of 
equation (A2) being in the range of a;_i is satisfied, since the only fluctuating terms are in r;^' , 
which can precisely be controlled by the freedoms G^'^. The solution imposes the fluctuating part 
of G;^' , which is given by 

osc(G5) :=0, 

osc(G?):=^. (30) 

At this point, the average part of F^' remains free and must be identified by the secondary 
and tertiary requirements (17) and (19). The magnetic moment requirement (19) is not concerned 
here, since F is automatically zero. Last, for the requirement for the maximal reduction (17), 
only the components F^^' can be controlled, and setting them to zero imposes the average part of 

avg(G5):=0, 

avg(G?):=0. (31) 

Formulae (30)-(31) give the traditional (lowest-order) Larmor radius. 

All other components of the first-order transformation generator G^^ still remain undeter- 

mined. They embody the non-uniqueness of the matrix inverse ((x'_i)~^ and the corresponding 
freedom will be useful for the solvability conditions at the next order. As a consequence, there will 
be some order mixing: the components b, 0, /i, 6 of Gi will be determined at higher order, at the 
same time as the components c, a of G2. 

Last, the gauge function 5*0 can not be useful and is set to zero 

5_i = 0. 
The final zeroth-order reduced Lagrangian (29) writes 



Fo = A-dq + y/2fiBb(j)-dq - ^3(1 + (p'^)dt . (32) 

It is just the average of the zeroth-order initial Lagrangian Fq: Fq = avg(Fo). The zeroth-order 
reduced 2-form tJo is then 

dFo = Wo , 
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which justifies the notation Uq introduced in (23). 

It is easy to see that the matrix U-i + Uq is not invertible, for instance all its gyro-angle 
components are zero. So, the pivotal matrix for high orders Mqo mentioned in the appendix is not 
identified yet. 

C. Turning point: Order 1 

At the following order n = 1, the reduced Lagrangian is given by (15), which is rather written 
with the unknown Gi^2 on the left-hand side 

G2-6^-i + ^-(Co + 2cJo) = Ti - dSi . 

As announced in the appendix, the pivotal matrix is the set of uj-i and (wq + 2aJo)/2, acting 
on the set of unknown components of (G2, Gi). Again, it is not invertible; this corresponds to the 
case where the inversion is possible only under some integrability conditions on the right-hand side 
Fi — dSi, to which one must add the set of —oj^i and —{ojq + uJq)/! acting on the set of components 
of (G2, Gi) that are already known, i.e. on G^^' . And the solution, if it exists, is not unique. 

More precisely, using a matrix notation, grouping in the left-hand side only the terms with 
unknown components of (G2, Gi) gives 
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( n n rj I Tt T^, r* I H, 



( dcSi daSi dbSi I d^Si d^Si dgSi I ) 



(33) 



where the exponent indicates matrix transpose. The coefficient /21 is defined by /21 := — ^ — , 
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and J21 and K21 are defined the same way. Last, for shortness, we used the short-hands 



dc-.-- 

da-.-- 

df.-- 
df>:-- 



c-V, 
a-V, 
b-V, 
—a-dc 



Be careful, dc is different from dc- 

Let us have a word on the graphical presentation for these matrix products, because it may 
seem surprising at first glance and it will be frequently used. In principle, 1-forms are row-matrices, 
vectors are column-matrices, so that the pairing between them is just given by the matrix product. 
Now, if 2-forms are presented as a 2 * 2 matrix, then the vector implied in the left-pairing must be 
written as a row-matrix, for the pairing to be just the usual matrix product. In this paper, in order 
to save room for the editor, we wrote this row-matrix as the transpose of a column-matrix. By 
the way, it makes formulae easier to read, because each component of the vector is facing precisely 
the row of the matrix which it multiplies: for instance in the second term of equation (33), the 
third row GJ of the vector Gi multiplies the third row (— /21, K21, | — \/2fiB, — 2u ' ^ I ^bHo) 
of the matrix 2cZ;o + ^o which is on its right. So, this graphical presentation seems to be well-suited. 



Equation (33) can be simplified by removing the components that do not contribute 
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( i9c5i daSi dtSi I d^Si d^Si deSi \ ) . 



(34) 



This formula illustrates the typical form for r„ announced in the appendix. In the left-hand side, a 
matrix product involves the unknown components of (G„+i, G„, ...). In the right-hand side, there 
are two kinds of terms: the terms with the known components of (G„, G„_i, ...), and the terms 
involving the freedoms to be determined by the integrability conditions, i.e. r„ and Sn- 

The requirements concern the seven components of the reduced Lagrangian T^, which are 
ideally put to zero. The freedoms are embodied in six components of (G„+i, G^, ..•) and in the 
gauge function Sn- More precisely, the primary requirement (averaging reduction) means that 
the fluctuating part of the reduced Lagrangian must be zero: osc(r^) = 0; actually, the chosen 



24 



basis for 1-forms is not averaged, so that each component can not be averaged separately, e.g. 
neither b-Vb-a a-dq nor b-Vb-c c-dq are averages, but the sum of them b-Vb-dq is an average. 
The secondary requirement (magnetic moment reduction) means that the gyro-angle component 
of the Lagrangian must be the magnetic moment: T^ = JI, and r„ = for n ^ 1. The tertiary 
requirement (maximal reduction) means that the average part of the reduced Lagrangian should 
be zero as well: avg(r„) = (except Ti). When integrability conditions can not be solved for so 
strong requirements, the tertiary requirement is released, but as little as possible, and one sets to 
zero as many components r„ as possible. 

Let us solve equation (34). The pivotal matrix is not invertible. It has six columns and seven 
rows; the columns (p and fi are not linearly independent; in addition, its column for 6 is zero, 
which means that Ti can not be controlled by the freedoms (G2, Gi). So, the existence of solution 
is submitted to integrability conditions (by which the freedoms Si and Ti can be constrained), 
and then the existing solutions are not unique. 

The integrability conditions are automatically satisfied. The overall equation for the column (p 
will be automatically zero, because the procedure will imply G^ = 0, and 5*1 = 0. 
As for the column 9, it writes: 

= -^Gl+Tl-deSi. 

The averaging requirement means that dgSi =0, which implies 

osc(S'2) = . 

The average part of the equation then writes 

Tl = ^Gl = ^^, (35) 

which makes the automatically verified for this component. In fact, if /x had not been chosen as a 
preliminary coordinate, it is here that it could be identified. 

As a side comment, it might seem that the right-hand side of equation (35) should be JI instead 
of fi, because of formula (19). However, in the derivation, all expressions for F are functions of 
yU. In the reduced system, the corresponding expressions will be the same functions evaluated on 
the reduced coordinate JL, as is well emphasized in [5]. So, formula (35) means that the first-order 
reduced Lagrangian Li will actually have its 9 component equal to JL. It is why this result verifies 
the requirement (19) on the magnetic moment. We will not insist more on this point. 

With the result (35), the reduced Lagrangian will contain the term fi59, implying the presence 
of the vector Rg, which is not determined. This feature already appeared in the computation of 
ojQ, but the vector R^ involved in 59 = —{dc — (iq-Vc)-a came from a term — dc-a, so that the 
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overall contribution of the vector R^ actually cancelled. Just the same way now, the term fj,60 is 
required to come from a total contribution 

— /ua-dc = fi{59 — dq-Hg) . 

This imposes a non-zero contribution for T : 

pq — f"' - fidq-Kg . 

For a maximal reduction, minimizing T then means minimizing V^ (and ideally setting it to zero). 



For the remaining 5 columns, the pivotal 5*5 matrix is invertible for the five unknowns 
G2, G2, G^, G^, G]* j , as is clear through the following argument: 

- The freedom G^ controls the column for Ti, since the coefficient —\/2fiB = —psimp is 
invertible and no other unknown component of (G2,Gi) appears in this column. This does not 
determines fully G^, since the freedom avg(S'i) appears in the column, and avg(S'i) is to be 
identified by the column Ti , in which G^ appears again. Thus, the set of ( F^^ , F^^ 1 can be 
considered as a coupled set of equations for I GJ, avg(S'i) I . But it is solvable; it implies that 



avg(S'i) = /C((/>^, q) , (36) 

is an arbitrary function of (f>^/JI and q, and that 

Gi = 7^V^g(^i)- (37) 

- In a similar way, the set of (Fj^ ,F]^ j is a coupled set of equations for (Gj'jG^'j, which is 
solvable. 

- Indeed, the freedom G^ controls the column for F^ , since the coefficient \/2fiB is invertible. 
The solution for Gj" is then parametrized by Gj*, which is still unknown, but appears in this 
column. 

- Then the freedom G^* controls the column for Ti = Hi, because when inserting the solution 
for G^, the coefficient of G^* becomes just B, which is invertible. 

- The freedom G| controls the column for F^ , since the coefficient —B is invertible and no 
other unknown component of (G2,Gi) appears in this column (because now G^ and Gj* are not 
unknowns any more). 

- The freedom G| controls the column for F^ , since the coefficient B is invertible. The solution 
for G2 is then parametrized by G^, which is still unknown, but appears in this column. 

- Then, all the components of F'^^ remain free and are set to zero, so that the optimal requirements 
are fulfilled. The freedom /C in avg(S'i) (which is a parameter in the formulae obtained for G^, 
Gj", Gj*, Gl, and G2 can not be useful to improve anything and is set to zero: 

avg(5i)=0. (38) 
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- The freedom G^ is still undetermined, it embodies the non-uniqueness of the solution implied 
by the pivotal matrix being not invertible. This freedom will be useful for the requirements at 
the following order. As a consequence, the order mixing will not include only two orders, but 
three of them, and the pivotal matrix will act on components of {Gn+i, G„, G„_i). In addition, 
the orders will not be solved independently, since the unknown G^ will be identified at order 
r2, whereas it already appeared in the equations at order Fi, so that it is a parameter in the 
expression computed for G|. 

This procedure shows that the average and fluctuating parts of the equations are dealt with 
the same way, because the equations for G„ is algebraic. This is very different from the minimal 
guiding-center reduction by Lie transforming the equation of motion, whose equation relies on the 
operator de, and which easily controls the fluctuating part of the equation, but involves secular 
differential equations for the average part of the equation [12]. 

At that point, it seems that the order Fi has been completed: it is indeed completely satisfactory 
in itself, since all of the requirements are perfectly fulfilled, with the resulting reduced Lagrangian 
Fi = —fia-dc. However, the procedure will have to be slightly changed, because at the following 
order the secondary requirement for F2 can be controlled by no higher-order freedom; it can be 
controlled only by avg(Gj'), which was already determined above by the tertiary requirement for 
T^. These two requirements can not be simultaneously fulfilled and one of them has to be dropped. 

It is here that the requirements are not dealt with in the same way: as one of them must be 
dropped, the choice is imposed by the hierarchy and the secondary requirement must be preferred 
to the tertiary one. 

As a consequence, avg(G]') must be let free at first order. It remains a parameter in G^ and 
in Hi. Another consequence is that the tertiary requirement for F^ has been lost, and Hi has a 
non-zero value. Actually, the equation for Hi was coupled with the one for F^ , and the non-zero 
term can be put in either of these components of F. 

One can consider recovering a zero value for this term by using the freedom /C available in 
avg(S'i), which had been arbitrarily fixed to zero in the process above in equation (38). But the 
corresponding equation for /C has no solution. Indeed, requiring avg(i7i) = is an equation for 
/C, which is the only available freedom 

[^d, + ^(1 + e)d^]JC = -^avg(GO. (39) 

With the expression (53) for avg(Gj'), it can be studied by expansion in series }C{(j)y/Jl) = 
^^/Cfc(0-y//i)'^. Expanding the right- and left-hand side of the equation and equating the co- 
efficient of the same orders in (j) and /i gives a non-solvable equation. For instance, the coefficient 
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of order (p fJ, implies the following equation 



V-b/C, 



-b-V X b, 



(40) 



which has no solution for a general magnetic geometry. So, the only available freedom can not be 



T^'b 



used to obtain the full reduction F^ = and Hi = 0. One of those components has to be non-zero. 



Computing explicitly the expressions for the solution at order Fi according to the procedure 
identified above gives the following results 



osc(5i) = G^ = 
osc(Gf) = ^(l + 02 

osc(G5') = ^^^^^ 



9^^^^ + (hab'b 



B 



Ba 
B 



^cba±abc _ 202ab'b 



(41) 



(42) 



^^g(Gr) = 7^ [^1 + M^ 



T/b 



, cba — abc 



2^ 



avg(G^; 



Hi = y/2iiB<t) Fi + ^l 



T/b 



, cba — abc 



r;a _ _M 



B 



Gl 



jj 7cba— 9abc 
^2flBG^ 



b'^aWb 



+ i?avg(GO 

+ vW-g(GO 



^aca , 



(43) 



where following Littlejohn, a condensate notation is used for gradients: the curved prime is used 
to indicate gradients of the magnetic field, and the short overbar over a vector c or a indicates 
the matrix transpose (for the euclidean scalar product), so that cb'a := a-Vb-c and B^a := a-VB. 
This notation comes from [6], slightly adapted to make it more explicit, in order to fit in with 
higher-order expressions [12]. Be careful, the straight prime and the long overline are different and 
do not indicate gradients or matrix transpose, e.g. in F . 
Finally, the first-order reduced Lagrangian writes 



T^'b, 



Fi = —fia-dc + Ti b-(iq — Hidt . 



(44) 



T/b 



In the results above, the term F^ was kept free in order to include both of the choices considered 
above. As F^^ is free, the natural choice for a maximal reduction is F^^ = 0; then the reduced 
Hamiltonian is non-zero: 



H^ 



^ 



V^cp 



cba— abc 



+ i?avg(G^; 



(45) 



Alternatively, Hi can be set to zero, by choosing 

r'''' B ^„„('r~i/^\ ,,cba-abc 
^1 = -^^^^^^g(Gl) - ^ 2 • 

This last choice is possible only if the inversion of (p does not cause a singularity, i.e. if avg(Gj') 
has no overall contribution of order zero in cp, which will have to be verified when avg(Gj') is 
identified (see equation (53)). 
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The first-order Lagrangian (44) induces the following first-order reduced Lagrange matrix, which 
will be part of the pivotal matrix at higher orders: 



wi 



/ 










dcHi\ 




^1 








daHl 






-dA' 


-d^T[' 


dbHi 





d^T[' 








d^Hi 





a^rf 





1 


dpMi 











-1 





\-dcHi 


-daHl -dtHl 


-84, Hi 


-dfjii 


/ 



(46) 



where the matrix a;^''^ is defined by 



dci-oj^'^-dci := (iq- - ^Vb-a A cb' - ( V x (F^ b 



T/b, 



dq. 



At the end of the first-order analysis, three freedoms remain: G^, which is a parameter in G|; 
G^, which is a parameter in Gj* , in G| and in Hi; and either F^ or Hi, which is a parameter in 
the other one. 

The results (41)-(43) have physical implications. For instance, Gg^ determines (together with 
Gi) the perpendicular component of the second-order Larmor radius (q — q)2 = G2 — 2Gf -^zG^. 
Its averaged contribution will imply that the Larmor radius is not a pure fluctuation, and later on, 
a non-zero G2 will be obtained, which will imply that the Larmor radius is not purely transverse 
to the magnetic field. 

Those features are well known in guiding-center works, and we do not insist on them. Here, 
we focus on the mechanism of the reduction, to show how it can be performed to arbitrary order 
in the Larmor radius using gauge-independent coordinates for the gyro-angle, and why it can be 
considered as a maximal reduction. 



This point is the turning point of all the reduction: the matrix uj-i +c<7o +ZI;i is invertible. 

Thus, in the induction for high orders, the pivotal matrix Mqo will be the set of I cJ_i,a;0)^i 
acting on the set of unknown components of (G„_|-i, G„, G„_i). 

The first stage of the reduction is achieved, rn, = 1, which means that ric ^ 2nc + 2 = 4. 
The algorithm for the derivation at high orders (third stage mentioned in the appendix) can be 
identified by now, but it will be efficient only for orders n ^ 4. The intermediate orders (named 
"second stage" in the appendix) must be studied separately. 

In order to introduce the derivation order by order, we first go through the second stage and 
postpone the third stage, but it is important to notice that this last is independent of the second 
stage and could be studied before. Especially, all the high-orders algorithm relies on the matrix 



M, 



together with the differential operators involved in dSn- They are already known by now. 



and are determined by the choices that have been made previously, and mainly by the choices at 
order 1. 
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D. Core of the second stage: order 2 

At the following order n = 2, the reduced Lagrangian is given by (15) 

fs = G3-c^_i + Gs-ZJo + ^•d[Gr(3ZI;o + 2Gjq)] + dS2 ■ (47) 

At the previous orders, it appeared that for the algebraic part of the equations, the requirements 
on the average Lagrangian were dealt with exactly the same way as the requirements on the 
fluctuating Lagrangian. So, they will not be studied separately. Only the integrability condition 
will restore a difference between them for some of the unknowns. 

At this point, the scheme for the unknown Gj is not purely algebraical: Gi is still not completely 
known, and it is involved in a first-order differential non- linear equation because of the term ic-^^dX, 
with 

A:=iGr(3tJo + 2wo). 

In equation (47) , the exterior derivative must be computed for the 1-form A that is not explicitly 
known yet. Unlike in previous subsections, an explicit computation is not possible, and an abstract 
formula must be used; care must be taken that the usual formula (10) for exterior derivative can 
not be used, because it is valid only when the basis is composed of closed 1-forms. Otherwise, it 
is replaced by the more general formula: 

d-f = d{-f^ej) = d{j^) A Bj + -f^dej (48) 

= ei{di^^ - djY)ej + -f^dej , 

for any 1-form 7, with components A-^ in the basis Bj. 

In formula (48), two operations are to be identified: the action of the exterior derivative on 
scalar functions d'j^ expressed in the chosen basis for 1-forms, and the exterior derivative of the 
basis dej. 

For the exterior derivative on scalar functions d'j^ , it writes as usual 

d{j^) A ej = ei{da^ - djf)ej , 

provided the differential operators di are given by the dual basis (28) to the chosen basis (25) of 
1 forms ej. 

As for the exterior derivatives of the basis dej, they are easily computed as: 

d(#) = didfi) = 
d{b-dq) = — dq-(V x b) x dq 
d{c-dci) = — (iq-(V x c) x dq — 50 A a-dq 
d{a-dq) = — dq-(V x a) x dq + 69 A c-dq 

d{59) = -ab'dq A cb'dq + d(ac') A dq . 
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In d59, the derivative a-c' appears. Such terms should be avoided, since they are not determined, 
as shown by (6). But there is no trouble here, since in all the 1-forms in this derivation, the term 
eg = —a-{dc — dq-Vc) comes from a-dc = — ei + dq-Vc-a. The exterior derivative of the second 
term will generates d{a-c) A dq, which will automatically cancel the term R^ coming from d69. 

As a result, there is no need to compute those terms. Computing the exterior derivative of a 
1-form 7 can be made according to the following procedure: write 7 = 7' — 7 R^-fiq, where now 
the spatial components j"^-dq^ do not involve ac'. Then apply formula (48) to 7' and to — 7 R^-dq. 
The cancellation of the terms containing d(R,g)- A dq gives the resulting formula: 

d7 = ei{da'^ - djj")ej + 7'^d'e, - eida'^ A Rg-dq , (49) 

in which d'e-i = de-i — 5 d{a-d) A dq, where 5 is the Kronecker delta, which means that d'ei is 
exactly dei but without the problematic term in d{50). 

With formula (49), the quadratic term in Gi is found to have its momentum components linear 
in the unknowns ( G^, avg(Gj') I , and differential only for G^ 

^iiG^dxY = -G\d^[2G\Yig-a - 2G?] (50) 

&{iG,d\Y = -G\d^,[2G\Yig-a - 2G?] + AG\Yig-a - 4G? 
6(iGidA)^ = -G\de [2G\Yig-a - 2G?] + 6G^ + Gt6J322 , 

where the coefficient J322 is defined by 

J322 := f (3J + 2.1 + 2'Rg-a) = fi U(cb'a - ab'c) + ^ 

Notice that it does not depend on R^, precisely because the term 2ac'a coming from formula (48) 
cancels the corresponding term in J. 

The differential equation for G^ does not make things much more complicated, since it can 
be easily solved, e.g. by expansion in ^Jji and (p. Alternatively, a trick can be used to make the 
scheme purely algebraical [22]; it is not essential for the derivation, but we will use it because it 
simplifies much the explanations. 

The idea is to notice that in this case, the only differential operators involved come from the 
second term of the exterior derivative {diX^ — djX'') dy"^ fS> dy^ , i.e. they are involved in expressions 
that write —G\djX^dy^ = —G\dX^. In addition, in the sum over the index i, only one of the terms 
involves a differential operator d acting on the unknown G^, namely the term with i = a. An 
integration by parts over this term can transfer the differential operator over the pre- factor Gf, 
which is already known. This is a way to make the equation algebraic. 
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This integration by parts is justified by the gauge function. The equation for r2 can be added 
a total derivative, which can be chosen (i(Gf A^) and extracted from dS2 by ^5*2 = dS'2 + (i(Gf A^): 

-GldX^ + dS2 = -GldX^ + dS'2 + d{GlX^) = dS'^ + X^dGl , 

and this formula is not differential any more, but algebraic for G^, which is contained in A^. 



Then, equation (47) becomes 



r2 = Ga-cj-i + G2-a;o + A + dS'2 , 



(51) 



with 



A:=iG,dX + diG^X^) 

= ( Ac Aa Ab I I At ) 



\G1 



T 







ac'a J322 
1 
0-10 







where the momentum components are written in matrix form because they will determine the 
unknowns (Gf , avg(Gj')) . 

Now, the induction relation just relies on a pivotal matrix M2, which appears in the set of 



fG3-a;_i,G2-a;o,A 



in which a linear algebraic operator (a matrix) acts on the unknown components of (G3, G2, Gi). 

It is invertible in the sense that it determines six unknown components of (G3, G2, Gi), which 
is the maximum that can be done at each order. Remember the matrix can not be fully invertible, 
since the transformation is time-independent, so that G„ is 6-dimensional, whereas the matrix 
has value in the 7-dimensional space (q, p,t); and by the way, the 7-dimensional matrix is anti- 
symmetric, hence not invertible. The seventh requirement is to be provided by the gauge function 
^2, which is the only integrability condition involved at this order. 

More precisely, removing all the coefficients that do not contribute, as was done in (34), equation 



(51) becomes 



T 



+ 



+ 



+ 



Gl 



Gl 



T 







in 



I rt 



I H2 



{dcS'2 daS'2 dtS^ I 84,3^ df,S'^ dgS'2 I 
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(52) 



where the left-hand side contains just the terms involved in the matrix inversion to determine the 
unknown components of (G3, G2, Gi). 



Formula (52) is similar to (34), and the same comments can be done as at the previous order, 
in the two paragraphs following formula (34). 

To solve the equation, an analysis similar to the one at the previous order leads to the procedure 
summarized in the following tabular, where each row corresponds to one of the equations equation. 
The component r2 of the reduced Lagrangian involved in the corresponding equation is indicated 
in the first column; the unknown which controls the equation and permits r2 = is indicated in 
the second column; the coefficient to be inverted is indicated in the third column. 



Equation 


Unknown Coefficient 


avg(r;) 


avg(GO 


1 


osc(r2) 


0SC(S'2) 


de 


^2 


G^ 


-yJ2llB 


^2 


G? 


-1 




Gt 


V2/uS 


H2 


G^ 


B 


^2 


G| 


-B 


n 


G^ 


B 



A few comments are in place. As had been announced in the derivation of formula (42), the new 

n 

feature is that the equation for avg(r2) can be controlled only by the average first-order magnetic 
moment avg(Gj'). It is why it was not available at the previous order. 

The coefficient de is an operator, but it is invertible over gyro-fluctuations. 
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The set of ( r2 , r2 j is a coupled set of equations for ( G2 , Gg j , but each of the unknowns can 
be assigned to one of the equation because the system can be solved in the following way. The 



freedom G2 controls the column for r2, since the coefficient \J2^B is invertible. The solution 
for G2 is parametrized by G2, which is still unknown, but appears in this equation. Then, the 
freedom G2 controls the column for r2 = Hi, because when inserting the solution for G2, the 
coefficient of G2 becomes B, which is invertible. 

At the end, the reduced Lagrangian r2 is free and can be set to zero, as required for the 
maximal reduction. The freedom avg(S'2) (which is a parameter in the formulae obtained for G^, 
G2, Gg, G2, G|, and G3) can not be useful and is set to zero. 

In a similar way as at the previous order, at that point, the order r2 is completely satisfactory 
in itself, since the reduced Lagrangian has been fully reduced r2 = 0. 

However, the procedure will have to be changed, because at the following order the secondary 
requirement for r3 can be controlled only by avg(G2), which is therefore not available to get the 
tertiary requirement for r2. So, this last requirement has to be dropped. 

Accordingly, avg(G2) remains free at this order, and it is a parameter in G| and in r2. This 
last has a non-zero value, but its equation was coupled with the one for r2, and the non-zero term 
can be put in either of these components of r2. 

One can consider recovering a zero value for this term by using the freedom avg(S'2), whose 
value had been arbitrarily fixed to zero in the process above. 

Indeed, if avg(S'„) is let as a free parameter in G2, G^ and Gg, then when replacing these 
variables by their expression, the requirement avg(i/2) = becomes an equation for avg(S'2). 
Unfortunately, this equation is not easily studied. 

As a first attempt, S2 can be chosen such that it is absent from G^, as it was done in equations 
(36)- (37). Then it is easy to see that the equation for avg(S'2) will have the same structure as 
equation (39). This equation may not be integrable, as it was the case for equation (39). In 
this case, one should relax the condition for avg(S'2) to be absent from G^, and the differential 
equation for avg(S'2) could be more difficult to study, because G^ is involved in the 1-form A in 
equation (52) in a rather intricate way. 
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Applying the procedure identified above gives 
avg(G^) = ^^f5(/.(ab'c - cb'a 
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(53) 



(54) 



(55) 



where 
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With the results (53)-(55), the parameters involved in formulae (41)-(43) can be made explicit: 



-g(Gf ) = 7^ 



rJ + ^i±Ml(cb'a-ab'c) 



Hi = y/24xB(j) 



— b cb'a-ab'c 



/-la _ _M 
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(56) 



(57) 



The reduced first-order Hamiltonian for the choice Fg = is 



Hi = ^J.yj2llB4)~^ 



The reverse choice Hi = is possible, since its existence condition in equation (45) is satisfied, as 
is clear in (53). It corresponds to a component F^ of the reduced Lagrangian given by 






(58) 



which is regular in (^ = 0, as expected. 

In formulae (53)-(57), the components Gi and G2 have been completely computed and simpli- 
fied for comparison with previous results, because it is the point where usual derivations stop. 
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For the other components G2, G3, G3, and i^a, formulae (53)-(57) are exphcit solutions. 
Their right-hand side involves only known quantities (or quantities that are free parameters 
for these relations), but it has not been not expanded and simplified. This can can be done 
in a straightforward way just by computing explicitly the terms involved, but we will not 
pursue in that direction, since the calculation for the Ac,a,b is lengthy, and useless for our 
purpose, which is just to show how the procedure can be performed to arbitrary order. In addi- 
tion, they are the topic of a work by the authors of [18], and were already partly introduced in [19]. 



In the results, the term r2 was kept free to include both of the choices considered above: setting 
the non-zero term in H2 just means choosing r2 = 0. The other choice H2 = corresponds to 

r5 = avg[R2^ - ^^{i3G^ + R* + G2^1 + <P')iiB'b} 

This last choice is possible only if the term inside the curled parentheses has no overall contribution 
of order zero in (j). 

Also avg(S'2) was kept free, because one can consider using it to obtain the full reduction with 
both r2 = and H2 = 0: it will imply a differential equation for avg(5'2), which might not be 
integrable, just as happened at the previous order in equation (39), but it might be integrable, or 
partly integrable, and then provide the full reduction, or at least a stronger reduction. Otherwise, 
it can be set to zero. 

At the end of the analysis at order n = 2, it remains one unknown, one freedom, and a binary 
choice. The unknown is avg(G2); it is a parameter in G2 and in H2 (or r2); it will be determined 
at the following order. The freedom is avg(S'2), which remains free, but could not be used to 
improve the reduction. The binary choice is that either r2 or H2 is set to zero, the other is 
computed accordingly. This is very similar to what had occurred at the previous order, but now, 
the unknown G2 does not appear as a parameter, since it is not at all involved at this order, as is 
clear in equation (52). 



E. End of the second stage: order 3 

Let us turn now to the following order T^. The equation writes (15) 
Tg = G4-c^-i + Ga-ZJo + G2-tJi - ^%^r_i 

The unknowns are components of (G4, G3, G2). The pivotal matrix is not Mqo, i-e. not just given 
by (a;_i,ZJo,^i), because of the correcting term —- — %-^T^i. This term might make the derivation 



more difficult, since it is not algebraic, but non-linear and differential for G2 
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■^■d{G2-dT^i] 



• [d{ - B{Glc) + 5(Gla)}- A dq] 



It can be written in matrix form 
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where the matrices Mij have obvious definitions, and are independent of G2' • 

This is enough to show that this additional term can be transferred into the right-hand side, 

i.e. it involves only terms that are already known at each step of the computation. 

When computing r3 for the unknowns G2, avg(G2), G3 and 030(53), the only components of 

G2 involved in the correcting term are in G2 , which is already known at that point. Then, when 

computing Tg' for the unknowns G3, G4, G4, and osc(G3) all the components of G2 are involved 

in the correcting term, but they are all known at that point. 

As a consequence, the correcting term can be put in the right-hand side of the equation, and 

the pivotal matrix is actually Mqo, i-e. the set of (a;_i,c<7o,^i) acting on the unknown components 

of (G4,G3,G2). This means that Uc = 3, and the order T3 can be included in the third stage, 

with all higher orders, which is studied in the following section. 

F. Third stage: algorithmic orders 4 and higher 

Now, the second stage of the method mentioned in appendix is ended and the third stage is 
beginning, which means that the matrix to be inverted is always the same at any order n ^ 3, and 
it is indeed invertible. So, the reduction can be performed to arbitrary order. The only possible 
complication comes from the integrability condition for the gauge-function Sn (and possibly r„), 
but after settling it, the reduction process becomes fully algorithmic and unique. 

1. Equation and algorithm 
This is proven by induction. Let us suppose that at some order n ^ 3, the set of unknowns are 



gn := r„,5„,G^i,avg(G::_i),oso(G^), 



G p0 P 



(59) 



nt ^-'n) ^^71+1) ^-"'ra+1 ; ' 



which means that before that order, all lower-order quantities gj<„ are already determined, and 
that after that order, all higher-order quantities gi>n will remain free parameters. This assumption 
is verified at order n = 3, which initializes the induction. As announced in the appendix, we have 
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included the reduced Lagrangian r„ in the vector g„, because some components of r„ can not be 
set to zero and have to be computed in the process. 
The reduced Lagrangian is given by equation (A6) 

Gn+l'^-l + Gn'^0 + Gin-l-iOl = r„ — R„ — dSn , 

where R„ indicates all terms of ...e^e^F that are of order n but do not involve Gn+i, G„ or 

Gn-l'- 



R„ := [...e ^e ^r]n - [Gn+yUJ-l + Gn-UJo + Gn-VU}i 



in which the index n indicates the term of order n. 



Then, the pivotal matrix Mqo is the set of 



acting on the unknown components of (G„+i, G^, G„_i). 

It is invertible in the sense that it determines six unknown components of (G„+i, G„, G„_i), 
which is the maximum that can be done at each order. The last requirement is provided by the 
integr ability condition for the gauge function S'„. 

More precisely, grouping as usual in the left-hand side only the terms with unknown compo- 
nents of (G„+i, G„, G„„i), and removing all the coefficients that do not contribute, the induction 
relation for r„ writes 
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(60) 



V r„ r„ Fjj I r„ r„ r„ | //„ ) 

{^dcSn daSn dt,Sn \ d^Sn d^Sn deSn \ ) 
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where the terms involving known components of {Gn+i, Gn, G„_i) have been grouped with R„: 



R„ — Rr 



(61) 
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Again, the same comments as the ones after equation (34) are in place. Also, when computing 
R„, formula (49) is to be used to account for the derivative of the chosen basis of 1-forms and for 



the cancellation of the derivatives of R^ . 



When solving the induction relation, the mechanism is the same as at order Fq for r„' , and the 
same as at order Fi for r„' ' and osc(r„). In addition, the new feature is the presence of cJi for 
r„ and avg(r„), but the mechanism is similar to what happens at order r2, in the sense that the 
pivotal coefficients are the same. Mainly, the procedure relies on three conjugation-like relations: 
(c, a) are conjugated for oj^i; {^,0) are half-conjugated for ZJi, in the sense that the structure is 
quarter-canonical; and (b, 0) are half-conjugated for the symplectic part of cJq. 

The same procedure can be applied, which is reminded in the following tabular. 



Equation 


Unknown 


Coefficient 


avg(r^) 


avg(Gt:„i) 


1 


osc 


(It.) 


osc(5;) 


de 


'- n 




G^ 


-y/2^xB 






Gn-l 


-1 


■"- n 






y/2iiB 


Hfi 






B 


K 




^n+l 


-B 


r 




^n+l 


B 



Then, all the components of r„ remain free and are set to zero, so that the optimal requirements 
are fulfilled. The freedom avg(S'„), which is a parameter in the formulae obtained for G„_^, G„, 
Gn, Gn, G^n+i^ ^^^^ G'^n+1^ ^^^ ^^^ t)e uscful and can be set to zero. 



In the same way as at orders 1 and 2, avg(Gn) must remain free at that order, because it will 

n n 

be needed to solve T^^^ at the following order, just as avg(G(:^_]^) is needed here to solve r„. So, 
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one can not have the reduced Lagrangian fuhy simphfied r„ = 0. One of its component remains 
uncontrolled, either r„ or r„. 

One can consider recovering a zero value for this component by using the freedom avg(S'„), 
whose value had been arbitrarily fixed to zero in the process above. Then, when computing 
G^_i, Gjj and Gn, the average gauge function avg(S'„) remains a free parameter. When replacing 
these variables by their expressions, the equation avg(i?„) = becomes a differential equation for 
avg(5„), whose structure is 

[^db + ^(1 + <^2)a^]avg(5„) = o.t. , (62) 

where o.t. means other terms that can be explicitly computed. This resembles equation (39), 
but here, the condition (36) has not been required in the process, contrary to what happened in 
previous orders. The reason is that now r„ is controlled by G„_]^, which has no effect on i7„. 
The integrability of equation (62) will depend on the right-hand side and must be studied at each 
order; a priori, it is not guaranteed, since obstructions such as (40) are possible. 

So, a systematic procedure can not use the freedom avg(S'„) to get the additional require- 
ment Hn = 0, which must be dropped. Then the freedom avg(S'„) is useless and can be set to zero. 

At the end of the n-th-order analysis, exactly all of the unknowns g„ have been determined. All 
the components of (G„+i, G„, G„_i) that remain unknown are in gi>n. Yet, this does not allow 
us to conclude that the induction is proven, because the unknown avg(Gn) already appeared as a 
parameter in G^_|_^ and in Hi; hence it is not completely free, whereas the induction assumes it 
is free (independent of the quantities gj^n), since it is in gn+i; it will be determined at the next 
order, and this could imply coupled equations, whose solvability is to be verified. 

However, avg(Gra) will be computed in equation for avg(r„_^]^), which corresponds to the column 
r„ in equation (60) at the next order, and does not involve any of the parameter-dependent 
quantities. Thus, there is no coupled equations, and the solutions are indeed explicit. This 
terminates the proof of the induction: the reduction can be performed to arbitrary order in e. 

Notice that here, the induction relation is considered from the point of view of r„; this caused 
an interlocking between the orders, where in the solution at each order, a parameter is involved, 
which will be identified at the next order, when computing r„_,_|. To avoid this interlocking 
phenomenon, it is possible to consider the induction relation from the point of view of ( r„' , r„_|_^ I . 



The drawback would be that when solving the equations for r„_,_^, one would begin the heavy 
computations for R[i_j,i, which are involved in r„_|_^, hence at the next order. In computations by 
hands, this can be a trouble, but when using computer-assisted computations, this is no trouble 
and it would probably be a more relevant choice. 
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2. Explicit induction relations 

The argument above emphasizes the distinction to be made between four kinds of quantities. 
First, some of the quantities are already known before the computation at order n, namely gA;<n- 

Second, for i {b, t\ the components r„ have not been computed yet, but they can be excluded 
both from the unknowns and from the parameters, since the algorithm shows that before any 
computation, they are already known to be zero for all n ^ 3, to fulfil the requirements (16)-(19) 

for r. 

Third, some quantities are not known yet, and will be determined after the matrix inversion, 
namely 



(gn)oo := (^avg(G^_i),osc(S'„) | (63) 

*^n' '^n-l' '^n+l? ^i+1 I 

avg(G^),osc(G^),osc(G^),;H^, 

in which a vertical dash | was written at the places where a vertical line will be written in the 
matrix Mqo below. 

Last, other quantities are not known and will remain free after the matrix inversion, namely r„, 
avg(5n) and gfc>n; the variables gfe>n will be determined at higher order, but one of its component, 
avg(Gn), is already involved in the equations at order n and behaves as a parameter in this matrix 
inversion. So, the parameters are 

(g„)„ := (avg(G^),avg(5„),r^) . (64) 

Notice that avg(G^) is included in the parameters (gra)^ even if is not an element of g„ but of gn+i- 

With the procedure above, the left-hand side of equation (60) can be written as just a matrix 
product, provided the pivotal matrix is extended, to act on all the quantities (gn)oo to be computed 
at this order, even the reduced Hamiltonian i7„ and the gauge function Sn- To include the gauge 
function Sn in the vector which is acted upon by the matrix, some coefficients in the matrix must 
be operators, and the equation will be transposed, so that the operators act on their right. For 
clarity, the order of the columns is chosen to fit with the steps of the algorithm 

(avg(r^),osc(rfj I rtr^,r^,r^ I 

avg(r„), osc(r„), osc(r„), avg(r„) 

so that the equations are solved one after the other in order. A vertical dash | was written at the 
places where a horizontal line will be written in the matrix Mqo below. 
With this order for the rows and for the columns, the equation becomes 

Moo-(gn)^ + M«-(g„)^ + R'^ = 0. (65) 
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Here, the rows are the equations to be solved, corresponding to the (re-ordered) columns of equation 
(60). The first term involves exactly the unknown quantities to be identified at this order. The 
second term involves exactly the parameters involved at this order but which will remain free at the 
end of this order. The third term involves only quantities that are already known at the beginning 
of this order. 

The matrices are given by 



and 



M. 



1 1 
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de 
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-V2f^B 






d. 


-^yi-l 






dc 


-/ 


-B 




da 
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B 




avg^^Ti db 






V2/ii? 


oscd^{T^) db 






V2^<^y| 


oscdi,(Hi) 


-HoV-b 




2/UB0 ^ 


\avgdf,(Hi) 


-HoV-b 




2iiB(t) -l) 



M. 




(66) 



in which the zeros were written only in the empty rows, for clarity, and we used that V-B 
implies ^ = -V-b. 



In the matrix Mqo, grouping together the eighth and ninth rows and columns produces a 9 * 9 
lower triangular matrix, whose coefficients on the diagonal are invertible, since the operator — a-9c 
is invertible on gyro-fluctuations. The eighth and ninth rows and columns have been grouped 
together because they constitute an invertible 2*2 matrix on the diagonal. A convenient way to 
invert the resulting 9*9 matrix is to separate its diagonal terms: 



M' + D, 
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where M' has null diagonal and D is purely diagonal. Then the equation can be written 

-D.(g„)^ = M'^.(g„)^ + M„.(g„)^ + R'^ . 
The solution of equation (65) is then 

(g„)^ = (-D)-i. [M'^.(g„)^ + M«.(g„)^ + R'^] , (67) 

where (— D)~^ is a diagonal matrix with coefficients 

(-D)-i:=Diag(-l,-(a,)-M^, 1,^,-^1 

in which D^ is the inverse matrix for the coupled system (the eighth diagonal term of the 9*9 
matrix mentioned above) 



The operator dg is the gyro-integral operator. It can be computed without introducing any 
gyro-gauge, with the intrinsic calculus introduced in [15], or with the matrix calculus introduced 
in [12]. Also, the coordinate 9 can be used as an intermediate quantity for this computation, which 
is made at constant q, so that the presence of a gauge (only for the intermediate computation) is 
of no consequence; then dg is the primitive with respect to 9 such that its gyro-average is zero. 
Over the Fourier modes A; / (i.e. over gyro-fluctuations), it is the operator ^. 

Even if (gra)^ appears in its right-hand side, formula (67) is an explicit solution for the induction 
relation: in the matrix M^, all the coefficients on the diagonal or above it are zero, so that when 
computing the unknowns one after the other starting from the left, each of them is computed as a 
function of previously computed quantities, i.e. the right-hand side contains only known quantities 
or parameters, but none of the remaining unknowns. Alternatively, the induction can be solved 
using a standard matrix inverse 

(gn)^ = -M^^-[M„.(g„)^ + R'^' 

with M^^ easily computed from (66), but the coefficients are more complicated, and practical 
computations usually follow the procedure of formula (67). 

For the solution g„, formula (67) or (68) must be completed by the following relations for the 
trivial components of r„: 

r^ = for ani0{b,t}. 
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and by the determination of the parameters (grOa- 



The first parameter avg(Gn) will be determined at the following order, in an equation that 
does not involve Gh and i7„, so that there are not coupled equations between the orders. 

The second parameter r„ is either put to zero or adjusted so as to make the reduced Hamiltonian 
Hn zero by the relation 

r' =avg [r^.^ + G;:_i5^r;' + dA (69) 



k^[G':^_^d^H, - G^^HoV-b + SG^ + R:*} 



This last choice is possible only if the term inside the parentheses has no overall contribution of 
order zero in (p. 

The last parameter avg(S'„) is determined by equation (62), in order to make both T^ and Hn 
zero, when the equation is integrable. Otherwise, it is determined to cancel as many terms of r„ 
and Hfi as possible, or it can be set to zero for simplicity. 

Accordingly, at the end of each order n ^ 3, the situation is the same as at the end of order 
2, with the presence of one parameter avg(Gn), of one binary choice between r„ and Hn, and 
of one free variable avg(S'„). When the integrability condition can be satisfied, the reduction of 
the Lagrangian r„ is complete and the solution is defined to within an element in the kernel of 
the operator (62). Otherwise, avg(Gn) is generally set to zero, and after the binary choice the 
transformation is unique, but on the whole there are two maximal reductions. As announced in 
the appendix, the unicity of the transformation is determined by the integrability condition for 
Sn, and possibly by an additional criterion for simplicity. 

IV. COMPARISON WITH PREVIOUS WORKS 
A. Agreeing results 

Computations of the previous section for the guiding-center transformation and reduced Hamil- 
tonian can be summarized by formulae (30), (41)-(43) and (53)-(57). As for the reduced symplectic 
Lagrangian, it is exactly known, and is given by formulae (32) and (44), together with the pre- 
scription that all other terms r„ are zero, except r„>2 (^s well as T^ ), which can be freely chosen, 
for instance it can be chosen zero, or such that it makes the reduced Hamiltonian Hn zero by 
formula (69). 

These results agree with the standard results of the literature, provided the connection vector 
is defined as R^ := R, which corresponds to the traditional gauge-dependent framework. For 
instance, in the paper [6] Littlejohn made the choice r2 = 0, and accounting for this choice, 
our formulae agree with his ones. In the paper [2], the choice is H2 = 0, and again, accounting 
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for this choice through formula (58) for T^ , our results agree with the ones of [2]. Thus, the 
procedure of the previous section succeeds in obtaining the standard guiding-center reduction 
without introducing any gyro-gauge and using purely intrinsic coordinates. 



The explicit induction relation (67) shows that the reduction is possible to arbitrary orders, 
but it also gives an explicit formula to practically compute the transformation generator order by 
order. At any order in the Larmor radius, all that remains to do is to develop the Lie derivatives 
involved in the term R^. Only the number of terms generated by the Leibniz rule makes the 
process difficult to compute by hand at higher order, but the explicit induction involves few 
basic operations (just exterior derivatives and matrix products) and can easily be implemented to 
higher orders on a computer. Actually, as the series is a polynomial, the derivation does not rely 
on formal calculus but just on symbolic calculus, which is still easier to implement. 

Once the generators G„ are obtained, the guiding-center coordinate transformation is given by 

z — >z:= ...e~*^2g-Gi^^ 

The reduced Lagrangian 1-form is 

r = avg(r) - /ia-dc + \^'^ + Y^ r^] b-dq - ^ Hndt , (70) 

where the Hamiltonian terms iJ„ are provided by (67), whereas the parallel Lagrangian terms F^ 
and r„ are chosen freely at each order n, and can be chosen zero. 

The reduced dynamics is obtained the usual way, by computing the Lagrange matrix uJg := dTg, 
then inverting it to get the Poisson matrix J := uJJ^, and last computing Hamilton's equations 
z* := T-'djH. Alternatively, the reduced equations of motion can be obtained by Lie transforming 
directly the velocity vector field 

z — 7- z := ...e ^e ^z . 

Here, performing these computations is useless, since the reduced Lagrangian (70) completely 
agrees with previous results in the literature in the gauge-dependent case, and so will the reduced 
motion. The main difference is that the gauge vector R = Vei-e2 is replaced by the general 
connection R^ = Vc-a. 

As a result, the reduced Poisson bracket is exactly given by its expression at second order 
(which has the traditional structure of the guiding-center Poisson bracket), but with a reduced 



magnetic field (see [5, 6]) given by the whole series for F 



B* := V X 



A + bjv/VBcA + Fj + ^F^IJ+V. (71) 



n>2 
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The vector V is generated by the term —/la-dc in the Lagrangian F, which imphes for the Lagrange 
2-form the presence of the term 

—fi da- A dc = —fi da-(aa + bb + cc)- A dc 
= —jJL da-b A b-dc 
= —fi db-a A c-db 
= —fi dq-Vb-a A c-b'dq 
= —n dq-Vb-b x b'dq 
= :-dq-Vxdq, (72) 

where, the first equahty comes by inserting the identity (aa + bb + cc) beside the wedge symbol. 
The second and third equahties come because (a,b,c) is orthonormal. The fact that the vector b 
depends only on q implies the fourth equality, which can be rewritten to get the final answer. 

In the reduced Lagrange 2-form, the term (72) is the counterpart in the gauge-independent 
approach of the term dq • (/iV x R) x dq involving the gauge vector in the gauge-dependent 
approach. Actually, the curl of the gauge vector V x R is gauge- independent, and it is explicitly 
given by 

—fi (i((iq-Vei-e2) = —fi dq-Ve2- A ei'dq 

= -;udq-Vb-b X b'dq, (73) 

where again, the second equality comes by inserting the identity (bb + eiei + 6262)- beside the 
wedge symbol and using the fact that (b, 81,62) is orthonormal. 

So, in the reduced magnetic field B*, the term with R^ exactly agree with the corresponding 
term with R of previous results. This is a good illustration of how the approach using the coordinate 
c completely agrees with the gauge- independent part of the usual results, whereas it proceeds in a 
different way and never introduces the gauge ei. Indeed, in the usual approach, Vei-e2 is assumed 
to depend only on the position, which implies the formula above. On the contrary, in the gauge- 
independent approach, the velocity is present in the first lines of (72), but it comes out from the 
computation that the result is naturally just a purely spatial term in the Lagrange matrix. 

By the way, the argument above shows that V is indeed the curl of — ^R^ = — /iVc-a, but it is 
not surprising since the spatial part of a term df A dg is given by the cross product with the curl 
of -fVg: 

dq-{df A dg)'^''i-dq = dq-Vf A g'dq 

= dq-(V(/<7'))Adq 

= -(iq-(Vx(/<7'))xdq. 
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Thus, formula (71) can be rewritten 



A + b{y2^(/. + r'l^ + Y.^n- mR-s} 



n^2 



B* := V X 
which is manifestly diver genceless. 



Here, for clarity and following previous works, we considered no electric field, in order to focus 
on the reduction mechanism. The extension for a non-zero electric field is straightforward, because 
the Lagrangian is changed only by the addition of —e^dt, with <I>(q) the electric potential [2, 3, 6]. 
It affects the spatial part of dzH, which plays no pivotal role in the derivation: it always appears 
in the right-hand side of the equations, i.e. in terms that are already known. As a consequence, 
the presence of this term does not change the procedure at all. In the results, each term of order 
n becomes a polynomial in <I> (or rather V<I>), which introduces a second parameter ce in the 
theory, which corresponds to ^^ or to "^nj ■ A more detailed study shows that the momentum 
in denominators is only the perpendicular momentum ||p_l||, and that at the lowest orders the 
perpendicular electric field can be one order higher than the parallel electric field [3, 5, 6]. The 
series remains perturbative provided the associated parameter is small ee « 1, as usual in 
guiding-center reductions. 

B. Polynomiality in the momentum coordinates 

In the results above, the usual expansions are recovered, but the choice of the coordinates c and 
(p makes all quantities polynomial in the used coordinates and monomial in y^ and y^- This is 
useful to simplify the derivation, which can be considered as a symbolic-calculus algorithm based on 
just two operations acting on words (polynomials) composed from a very restricted alphabet. Such 
structures can also be useful when going beyond the formal expansions and considering them as 
asymptotic series. For instance, the polynomiality in the cotangent of the pitch-angle is important 
to control the loss of accuracy of the expansion in the domains where the direction of the particle 
momentum is close to the direction of the magnetic field. 

Previous results were obviously polynomial in the variable c, but they did not use it as a 
coordinate, since they replaced it by the variable 9. In addition, the expansion in the usual small 



parameter e := y^2fi/BV := r^V was also present, but it did not correspond to an expansion in 
fi nor in B, but only in V, which is not a scalar quantity; it is why keeping the quantity e~^ was 
useful to indicate the expansion order. Here, the order is directly indicated by the power in y^ or 
yB, since each order is a monomial in those quantities. 

The monomiality in y^ is especially interesting, because the orders in the operator V and 
in the function B have only a dimensional meaning: the term of order e" will involve terms like 
(^~) ' ^^^ ^^^° terms like ^— g — ^b~^ °^ V^b. In a similar way, the order in B is given by 
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the order of the prefactor r£ = ^^iijB , but the variable B appears in other places when it is 
acted upon by gradients; then, it is compensated by a i? in the denominator, which means that 
gradients act only on the variable InB. 

The use of the variable (j) instead of the usual wn is crucial for the results at each order to be 
both polynomial in the coordinates and monomial in y^ and v5- It is a generalization of [12], 
which considered only the minimal guiding-center reduction. It seems it had not been noticed in 
previous works on the full guiding-center reduction [2, 5, 6, 19]. 

When using the standard variable vu instead of (j), the monomiality in y^ and yS is not 
verified: for instance, in Gf , the term of order 0^ writes — ^— ^ cb'b, which is not a polynomial 
in y^ and which is not of the same order in ^jjl nor in v^ as the term of order <p^, since this last 
writes -g^— ^^^-^, as is confirmed in [2, 6], for instance. 

On the contrary, when using the variable (j), the polynomiality is verified, because of the struc- 
ture of the Lagrangian, of the action of derivatives, especially (6), and of the coefficients to be 
inverted for the matrix inversions involved in the derivation. Notice that the induction procedure 
also guarantees that all formulae will be polynomial in the variable b. 

As for the monomiality, it is easily understood from a dimensional analysis: only three inde- 
pendent dimensional quantities are involved in the results, e.g. V, B and p. When using the 
momentum coordinates {(f), p,c) or ((f),n,c), two of them are dimensionless, and only one of them 
can generate the dimension oi p, namely p or y^. For an expansion in r^V = aZ-^V, the variable 
H can be involved only in the pre-factor of each order, otherwise, it could not be compensated to 
generate a dimensionless quantity. 

On the other hand, when using the momentum coordinates {v\\,p,c) or {v]i,iJ,,c), there is a 
redundancy in dimension between vn and p (or ^/JlB), which means that ratios of them are expected 
in order to get dimensionless quantities. In an expansion in e, if the term of order (r/^V)" is a 
polynomial in t;j|, it has to be actually a polynomial in ^ or J-r, . As a consequence, formulae 
will be sums of terms (^)" ( I > which is not a monomial in p (or equivalently in y^ and \'B); 
by the way, it is not a polynomial either, because of the terms where j > n. 

L.., .he po..„o„.a>i.. i„ , .nea„. .h. e«h .e™ wH.« (^,» (ig.)' . 
cos^ ipsin^"^ if, which agrees with the idea that the entities cos 99 and sin 99 come from 



m" 



expansions of the momentum p, or rather the corresponding dimensionless vector ^. 



C. A two- fold maximal reduction 

In addition to the averaging reduction and the inclusion of the magnetic moment among the 
reduced coordinates, the goal was to obtain a reduced dynamics as strongly reduced as possible. 
So, a complete achievement is obtained when all the components of r„ are zero, at least for higher 
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orders. For the procedure, it means using the seven unknowns to solve the seven corresponding 
equations, or rather twice these numbers, if one considers the average and the fluctuating parts as 
different variables. It did work for all the requirements but one, which could not be satisfied and 
had to be dropped. 

The obstruction for a complete reduction r„^3 = comes from the requirement (19): it 
imposes to obtain r„ by fixing the freedom avg(G), which was the only freedom available for 
the equation Tt = and is no more available for it. It is why one of the requirements has to 
be dropped. Then, it remains more unknowns than requirements. So, the transformation is 
not unique. Especially, while it naturally appeared in H, the non-zero component of F can be 
transferred to F . Thus two maximal reductions can be considered. 

The first alternative (called the Haniiltonian representation in [18]) sets (Fs)„^2 = 0. Then, 
the reduced Poisson bracket is completely known before computing the transformation to higher 
orders; it is given by the lowest three orders of the symplectic Lagrangian (Fs)„^i. The reduced 
Hamiltonian is not exactly known; it is given by a whole series in e and must be computed order 
by order. The reduced dynamics is a Hamiltonian perturbation of the guiding-center equations of 
motion at order 2. 

The second alternative (called the symplectic representation in [18]) is to set Hn^i = 0. Then, 
the Hamiltonian is completely known, and the structure of the Poisson bracket is also known. 
The only unknown information on the reduced dynamics is concentrated in the component of the 
reduced Lagrangian parallel to the magnetic field F„, which is given by a whole series in e. The 
Poisson bracket includes a kind of reduced magnetic field B*, induced by the higher-order terms 
of the Lagrangian. 

The choice of symplectic or Hamiltonian representation can be made at each order in the 
derivation, but it seems more convenient to be consistent and to make the same choice for all 
orders, as suggested in [18]. 

These two maximal reductions give a unified view of various choices that can be found in the 
literature, and they anticipate what will happen at higher orders. Indeed, even in the standard 
non-canonical Hamiltonian approach of the guiding-center reduction introduced by Littlejohn, 
several transformations can be found, often related by differences of choice related to this two-fold 
maximal reduction. For instance, Littlejohn's initial guiding-center reduction [5] corresponded to 
the second possibility above at order n = 1, but at higher order, it is unclear whether the procedure 
provided a maximal reduction, or if some terms could remain both in the Hamiltonian and in the 
Poisson bracket. The seminal reduction by Lie transforming the Lagrangian [6] corresponded to a 
maximal reduction with the first choice ("Hamiltonian representation"). 
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Later papers by Lie transforming the Lagrangian turned to the other choice, e.g. [2] used a 
maximal reduction (at order n = 1) but with the second possibihty. Recently, while improvements 
in the second order were addressed [18, 19], interest was renewed in the first possibility. The 
paper [19] actually corresponds to a mixed choice, where the second possibility is used at order 
n = 1 and the first one is used at order n = 2. The work [18] introduced the designation of 
Hamiltonian and symplectic representation to differentiate between the two choices, and studied 
an "equivalence relation" between the two choices (when the same choice is made for all orders), 
which is a way to go from one representation to the other by a redefinition of the reduced parallel 
momentum pu (or equivalently of (p). 

The algorithm at higher order, with the condition (69) for the symplectic representation, had not 
been studied in detail. In the previous section, the Hamiltonian representation appeared as indeed 
guaranteed at arbitrary order, and as naturally provided by the procedure as a maximal guiding- 
center reduction, whereas the symplectic representation appeared as submitted to a condition at 
each order in e: in formula (69), the term inside the bracket must have no overall contribution of 
order in (p. This could explain why first papers addressing both the first and the second order 
in e systematically used the Hamiltonian representation; the symplectic representation was used 
only later, when the condition was observed as verified. 

The equivalent relation introduced in [18] relies on a relationship between these representations, 
which allows to go from one to the other. Hence it might seem that it guarantees the existence of 
the symplectic representation, but it is not the case. 

The underlying idea (see equation (17) in [18]) is the following. Start from the reduced La- 
grangian written in symplectic representation (we use here the variable pn instead of (p, in order 
to agree with the notation used in [18]): 

Then, redefine the reduced parallel momentum so as to absorb all the series F in it: 

^ii-=^ii+E(n|i)«- (^4) 

n=l 

With this coordinate, the symplectic part of the Lagrangian is fully reduced: 

~ P\\ ' 
To obtain the reduced Hamiltonian with this coordinate, one just inverts the series (74): 

^11 ■= P\\ - E(nii)n = p'li + E(n||)« ' C^^) 
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with (nn)„ some coefficients easily obtained by inserting iteratively the first equahty in the occur- 
rences of pji in ^„=i(nj|)„, as is standard to invert a near-identity series. 
Then the reduced Hamiltonian in the new coordinate writes 



H = i,B + m+Y.(^'\\ 



It is a full series in e, which corresponds to the Hamiltonian representation. This is a constructive 
procedure showing that when the symplectic representation exists, then the Hamiltonian repre- 
sentation exists and is easily obtained. 

Now, what is actually needed is to go in the reverse direction, since the derivation of the 
guiding-center reduction shows that the Hamiltonian representation is natural and guaranteed to 
exist, whereas the Hamiltonian is suspected of having existence conditions. 

It turns out that the procedure in the reverse direction can break down. Start from the reduced 
Lagrangian written in Hamiltonian representation: 



H = fiB + ^ + J2Hn, 



n>l 



^b 



r =p,|. (76) 

Then, redefine the reduced parallel momentum so as to absorb all the higher-order terms ^„>i H^ 
in the term with pu : 

pf:=pl + 2Y,Hn. (77) 

With this coordinate, the Hamiltonian part of the Lagrangian is fully reduced: 

To obtain the reduced symplectic Lagrangian with this coordinate, one just inverts the near-identity 
transformation (77) by writing it as: 



n>l 



^r=±.M+2E^« = ^n/i + ^^^f^' (^8) 



and then by expanding the term \/l + e. This assumes that the ratio "i^' — - is small. The point 
is that this condition is not guaranteed a priori, even if the series is near-identity. 

Indeed, the derivations here are formal. "Near-identity" has only a dimensional meaning. It 
means that the ratio between the first and zeroth-order term is of order e = r^V, but only in 
dimension, its value might not to be small if it is multiplied by a large dimensionless factor such 
as 1/ cos 93, as in (78). Thus, a division by pu can cause a singularity. 

When going from the symplectic to the Hamiltonian representation, no such a division was 
needed, since the series inversion (75) just consisted in composing series. On the contrary, when 
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starting from the Hamiltonian representation, the series inversion (78) involves a division by p?,. 
This causes a singularity if Hn contains a term of order or 1 in pi i. 

It is interesting to see that difficulties arise here at pn =0 (or equally at (/) = 0), which is precisely 
where they appeared in the guiding-center reduction in the previous section. This suggests that 
singularities in </> = are indeed a difficulty for the Hamiltonian representation. Accordingly, at 
each order, it can be used only when the absence of singularity in (69) is verified. 

D. Gyro-gauge independence 

The intrinsic formulation of the guiding-center reduction was motivated by questions about the 
traditional gyro-angle variable 9. The derivation with the coordinate c shows that it does succeed 
in shedding light on those questions. 

First, the traditional coordinate was a detour. In all guiding-center works, the variable 6 never 
appears in itself (except in its own definition and subsequent relations); for instance, it does not 
appear explicitly in one of the components of G or T, which all depend on 6 only through the 
corresponding physical quantity c (or a := b x c); even the gyro-angle component of the generator 
G verifies this statement. The detour is not given by the physics, since it imposes to fix arbitrarily 
a gauge ei(q), which is not related to the physics of the problem. The role of the intrinsic approach 
was to avoid this detour, and it achieves its goal since it obtains the full guiding-center results 
without introducing any gauge and by working purely with c. 

From a mathematical point of view also, the use of the variable 9 was not completely 
satisfactory, because the gyro-angle corresponds to a circle bundle [7, 11, 16]. The traditional 
coordinate 9 makes this structure somehow disappear, because the manifold trivially becomes 
W^ X S^. It is why the variable 9 does not have a global existence in a general magnetic geometry 
[11]. On the contrary, when using the physical variable c, the circle bundle naturally arises: as 
c is defined on a space-dependent circle, spatial displacements imply a variation of c, so that a 
covariant derivative is involved, which encodes the circle-bundle geometry for the gyro-angle [11] 
and does not imply some restricted class of circle bundle. 

In some way, the relevance of this coordinate is obvious, since it just results from keeping the 
initial coordinate, in which all the circle-bundle picture was included. From this point of view, 
performing the derivation with this variable is a way to see how it globally agrees with the physics 
and the mathematics of the problem, and to make intrinsic definitions arise naturally for all the 
quantities involved in the process. 

Indeed, the previous section shows that the reduction follows the same procedure with the 
vectorial constrained coordinate c as with the scalar coordinate 9, but that there are slight changes 
in the quantities used. The gauge vector R came as naturally replaced by the connection R^ for 
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the covariant derivative. The generator of Larnior gyrations dg came with an intrinsic definition 
— a-3c. The basic l-form for the gyro-angle dO appeared as replaced by a non-closed l-form 56, 
which agrees with the fact that 9 considers the circle bundle trivial, whereas it should not. This 
implied to use more intrinsic definitions for the operations used, such as (48) and (49) for exterior 
derivatives. Also, this implied to be careful on how the basis of 1-forms and of vector fields are 
chosen, but the natural ones were found to agree with each other. 

Thus, the formalism with c is slightly more involved, but it perfectly fits both with the physics 
and the mathematics of the problem, which correspond to a non-trivial circle bundle. 

From a formal point of view, the results with the physical variable c mainly correspond to 
replacing the gauge vector R by the connection term R^. Thus, they include the standard 
gauge-dependent results as a special case, but they are more general: in the usual approach R(q) 
depends only on the position and can not be chosen freely (e.g. R = is not possible [5, 7, 11]); 
here, Rg(q, p) can be any function of the position and momentum. Especially, the physical 
definition of c corresponds to the function R^ := — 0Vb-c, which depends also on the momentum 
and preserves the polynomiality in (p. 

Other gauge-dependent quantities are interesting to consider. In previous works, the coordinate 
transformation 9 — > 9 was gauge dependent (see for instance equation (30c) in [6], or in [3] the 
solution for the generator G^ below equation (5.45)), as well as the definition of the coordinate 
9, and also the gradient ^qig. It is why the gauge vector R was involved in some of the resulting 
formulae, e.g. the Poisson bracket, in such a way as to make all the physical or geometrical 
(intrinsic) quantities gauge- independent. For instance, in the Poisson bracket, gradients appear 
only in the combination V* := V -|- Tidg [3]. It would be interesting to interpret it as the gradient 
corresponding to a special gauge, because it would remove the appearance of the gauge vector in 
all the derivation, and would simplify computations. The issue is that it is not possible because 
it would correspond to fix the gauge in such a way that Vei-e2 = 0, which is not possible even 
locally [5, 7]. 

In the gauge-independent approach, all the coordinates, including the gyro-angle c, are gauge- 
independent, as well as the transformation c — > c: at first order, it is not transformed by aG^, 
because the covariant derivative must be taken into account, which means that it is given by 
aG^ — G^-Vc. This last quantity is indeed independent of the connection vector R^ := Vc-a, 
as can be verified in (54). In the same way, at higher order, all the transformed coordinates 
z = ...e~^^e~^^z will be independent of R^, where z := (q, (^, ;U,c) are physical coordinates. 

Gradients are also involved in combinations involving R^. This is no surprise, since the connec- 
tion on the fibre bundle involves some arbitrariness, but the combinations can always be written 
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V^, := V + Tlgdg, which is connection-independent. In addition, it can be interpreted as the co- 
variant derivative associated with the trivial connection R^ = 0. Thus, when working with the 
coordinate c, this choice can be used to siniphfy computations and to make them connection- 
independent. 

E. Maximal vs. minimal reduction 

The derivation procedure confirms the respective interests of Lie transforming the velocity 
vector field and the Lagrangian 1-form. 

As with concerns the minimal requirements for the guiding-center transformation, working on 
the equation of motion is much more efficient, since it systematically obtains the fluctuating part 
of the reduced motion just by inverting the operator dg. 

The procedure with the Lagrangian is much more involved, as can be seen in previous sections, 
especially because the order mixing makes the scheme more elaborated and because the algorithmic 
stage begins only at higher order: the induction matrix mixes up the orders, changes at each order 
for n ^ 3, and involves differential operators in some coefficients. It is why in this paper, as in 
previous works, only a part of G2 is explicitly computed, whereas the work [12] directly obtained 
the full second-order generator G2. 

In addition, the minimal guiding-center reduction can hardly be obtained by working on the 
Lagrangian, because going from the Lagrangian to the motion mixes the components up. To 
guarantee an averaged slow reduced motion for the four coordinates (q, 0) , one would need to 
average all of the seven components of the Lagrangian, which is not a minimal transformation. 

As with concerns the additional requirements for the slow dynamics, working on the equation 
of motion is not efficient, because the equations to be solved are secular differential equations 
that are not simple to deal with [12]. Working on the Lagrangian is more efficient, because it 
essentially consists in algebraic equations, which deals the same way with gyro-averages as with 
gyro- fluctuations. This makes it easy to identify good choices for the averaged transformation 
generator avg(G„) in order to obtain a reduced Lagrangian as strongly reduced as possible. Thus, 
it provides a maximal guiding-center reduction almost as simply as the minimal one. 

Also, working on the Lagrangian 1-form makes it easy to impose requirements on the reduced 
Hamiltonian structure, for instance to obtain a quarter-canonical structure for the coordinates 
(71,^), which both provides a constant of motion /I and a Hamiltonian sub-dynamics for the 4- 
dimensional reduced motion (q, (p) . 
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V. CONCLUSION 

The full guiding-center reduction can be performed to arbitrary order in the Larmor-radius ex- 
pansion by Lie transforming the Lagrangian l-form while keeping physical gyro-gauge-independent 
variables as coordinates, following the same procedure as when working with the standard gauge- 
dependent gyro-angle. 

For higher orders, the procedure was shown to be completely algorithmic. The pivotal role 
is played by the inverse of the lowest-order Lagrange matrix uJ-i + ooq + ^i, together with a 
differential equation for the function Sn- An extended matrix was defined and used to explicitly 
solve the induction equation to arbitrary order in the Larmor radius. 

The results exactly agree with previous works, but they were obtained without introducing 
any gyro-gauge, and working purely with the physical coordinate c as the gyro-angle coordinate. 
In addition, the choice of the cotangent of the pitch-angle as a coordinate for the parallel velocity 
made the results purely polynomial in the coordinates and monomial in y^^ and \'n. 

Compared to the method by Lie transforming the equations of motion, the process is much more 
elaborated, especially because of the order mixing, but it easily obtains a much stronger result. 
It does not rely on differential equations for the reduced motion, but on algebraic equations for 
the reduced Lagrangian. A quarter-canonical reduced Hamiltonian structure provides a constant 
of motion JI and a Hamiltonian sub-dynamics for the 4-dimensional slow reduced motion (q, 0) . 
In addition, the procedure makes the reduced dynamics trivial not only in the gyro-fluctuating 
components of the Lagrangian, but also in six of the averaged components out of seven. 

As a result, all but one of the components of the reduced Lagrangian l-form are put to zero for all 
orders higher than two. Only one of them can not be made exact, and is given by a whole series. The 
two canonical choices are recovered: either to enclose the series into the Hamiltonian (Hamiltonian 
representation), then the reduced Poisson bracket is exact, either to enclose the series into the 
spatial component of the Lagrangian parallel to the magnetic field (symplectic representation), 
then the Hamiltonian is exact and the uncertainty of the reduced motion is traduced by a kind of 
reduced magnetic field B^,. 

The Hamiltonian representation appeared as naturally induced by the reduction process, 
whereas the symplectic representation is subjected to a condition at each order, to avoid a singu- 
larity in = 0, i.e. at the bounce points of particle trajectories. 

These representations make the reduction maximal because for a general magnetic field, the 
procedure can not get a stronger reduction for which even the last component of the Lagrangian 
would be zero. The obstruction originates from the special role of the magnetic moment; in 
the Hamiltonian representation, this can be viewed because the magnetic-moment component of 
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the transformation generator can remove only the fluctuating part of the reduced Hamiltonian 
function, since the averaged part is imposed by the requirement of adiabatic invariance. 

The use of gauge-independent coordinates had little effect on the reduction procedure. All the 
ingredients of the standard reduction with gauge-dependent coordinates were found to be present, 
but they naturally arose with an intrinsic definition or they were replaced by a different intrinsic 
object playing a similar role. 

It was observed to fit in with both the physics and the mathematics of the system, by restoring 
the general circle-bundle framework, which practically disappeared with the coordinate 0, and by 
making the coordinates directly induced by the physical state of the system. 

For instance, the gauge removal introduced a vectorial quantity c for the gyro-angle coordinate. 
This caused the coordinate system to be constrained and implied a connection for the covariant 
derivative on a space-dependent circle, which is directly linked to the circle-bundle structure un- 
derlying in the gyro-angle coordinate and which replaced the gauge vector of the gauge-dependent 
approach. 

The closed 1-form d6 was replaced by a non-closed 1-form 69, which is related to the non- 
triviality of the circle bundle for a general magnetic geometry. 

In previous works relying on the coordinate 6, the gauge- independence of the physical 
results implied that gradients were systematically involved in special expressions, which 
were not related to derivative operators because no gauge fixing were suited to them. These 
expressions were found to be related to covariant derivatives corresponding to suitable connections. 

Unlike the gauge fixing for the coordinate 9, the connection fixing for the coordinate c depends 
not only on the position, but on the momentum as well. This is all the more convenient as 
the physical definition of c and its associated connection depend on both the position and the 
momentum. In addition, this removes one of the assumptions causing the presence of anholonomy 
in the gyro-angle motion. 

So, the intrinsic gyro-angle coordinate c is a way to tackle some of the questions involved in 
the guiding-center anholonomy and gauge- (or connection-) arbitrariness. These questions are 
outside the scope of this paper and will be reported elsewhere [17]. 

In this paper, we focused on the formal derivation of perturbation series, as is usual in guiding- 
center works, and as is the standard first step in perturbation theory [14]. A next step will be to 
relate these formal expansions with asymptotic series, in a similar way as what was done in [23] for 
Kruskal's work, and what is beginning being done about Littlejohn's works [24]. In this attempt, 
the structures of the expansion series, such as the polynomiality induced by the cotangent of the 
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pitch-angle, are expected to play a role. 
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Appendix A: Mechanism of the reduction 

In this appendix, we introduce the mechanism at work when Lie transforming the Lagrangian 
1-form. Indeed, the basic ideas of the derivation are very elementary, but they are hidden by 
the details of the procedure, which are rather involved because of some order mixing and other 
subtleties between algebraic and differential integrability conditions. In addition, practical com- 
putations in the case of the guiding-center are somehow intricate. All the same, the method is 
very efficient and has quite a wider domain of application than just the guiding-center reduction. 
So, it seems useful to give a general overview of the method for people not familiar with it. 

1. Fundamental ingredients 

A) The goal is to solve equations (12) for the unknowns G„ and Sn, with the requirements (16), 
(17) and (19) identified above. Ideally, the maximal reduction sets r„ = for all higher orders n, 
as a result of (17). 

The solution is built order by order in the Larmor radius. Each order implies to solve the 
equation 

r„ = Gn+l-OJ-l + Rn + dSn , 

where R„ is a shorthand for all other terms, that do not contain the highest-order generator G„+i. 
The very basic idea of the reduction is that Gn+i is involved only through a matrix product. 
So, at any order, the solution is just given by a matrix inversion 



^n+l — ("^-ij 



-'■^n-Rn-dSn], (Al) 



provided u;_i is invertible; then r„ can be chosen zero, and Sn is not useful and can also be set 
to zero. This idea of matrix inverse is the key ingredient of the underlying mechanism, even if 
the corresponding basic picture is not true at the lowest orders, and at higher orders, it is slightly 
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complicated by some order mixing and integrability conditions, especially for Sn- 

B) As a matter of fact, the matrix oj^i is usually not invertible, since it corresponds to the 
fast part of the dynamics, here the Larmor gyration, which does not concern all the phase-space 
coordinates. 

At zeroth order, under the requirement r„ = 0, equation (12) writes 

Gi-uj^i = -To- dSo. 

It has a solution only if the right-hand side — Fq — dSo is in the range of the matrix a;„i (solvability 
condition); this is a necessary condition for the corresponding reduction to exist. Usually, it is not 
verified for so strong a requirement as Fq = 0, and the reduced Lagrangian Tq has to be used as a 
softening parameter. Then equation (12) writes 

GriO-i=To-ro-dSo. (A2) 

One has to check that, with the freedoms 5*0 and Fq, the solvability condition can be satisfied 
at least for the minimal requirement (16) and if possible for the intermediate requirement (19); 
then, the reduction is possible, and the maximal requirement (17) can be considered by trying to 
remain as close as possible to the condition Tq = 0. 

At that point, the solution exists, but it is not unique; it is defined to within an element of 
the kernel of oj^i. The choice of this element may be free at this stage of the reduction, but care 
must be taken that it may be constrained by the solvability conditions at the following order. 

At the next order n = 1, equation (12) writes 

G2-W-1 + ^-(^0 + wo) = fi - dSi . (A3) 

Now, the pivotal matrix Mi to be inverted is the set of a;_i and ii^ii±^^ acting on the set of 
unknown components of (G2,Gi). Its rank is greater than (or equal to) the rank of oj^i. Care 
must be taken that some of the coordinates of Gi are already determined. This introduces some 
order mixing, where some components of G„ are determined at order r„_i, others are computed 
at order r„ at the same time as some of the components of G„+i. 

Notice that Mi can not be invertible on the unknown components of G2, because of the non- 
trivial kernel of u;_i. In the same way as at zeroth order, this kernel has to be excluded when 
studying the invertibility of Mi, because it will be involved only at the following order. 

Then, if the pivotal matrix Mi is invertible for the ideal requirement Fi = 0, then the solution 
exists and is unique. Otherwise, there is again both a solvability condition and a non-uniqueness 
of the solution. More precisely, the solvability condition means that there is a solution only if 
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the right-hand side is in the range of the matrix to be inverted, and that in order to fulfil this 
condition, the reduced Lagrangian Fi may have be chosen non-zero, but having as many null 
components as possible. The non-uniqueness means that the solution is determined only to within 
an element of the kernel of the pivotal matrix Mi. The choice of this element may be free, but 
care must be taken that it may be constrained by the solvability condition at the following order. 



At the next orders, the same process goes on. The pivotal matrix M„ evolves at each order 
and its rank increases to determine more and more of the unknowns. At high orders, it becomes 
of constant rank, and actually it becomes the same at each order. The critical value of n at which 
this occurs will be denoted by ric, and the corresponding pivotal matrix will be denoted by Mqo- 
So, for n ^ ric, the pivotal matrix verifies M„ = Moo, whereas for n = ric — 1, it verifies M„ 7^ Mqo. 

This can be explained as follows: Equation (15) generically (i.e. at high orders) writes 



{Gn+vd) + {Gn-d){Gvd) + ...+ 



(Gl-d) 



n + l 



n+1! 



+ 



[Gn-d) + {Gn-vd){Gvd) + ... + ^^^ 



r-1 

Tq + dSn 



(A4) 



In this analysis, low orders are excluded because there would be some additional coefficients coming 
from the exponential series: for instance for n = 1, the term (G„-d)(Gi-(i) has a factor 1/2 and is 
confounded with the last term - — 5]-^. 

Denoting Gn-d by G„, and grouping together the highest-order Lie derivatives, which contain the 
unknowns (which are some of the components of (G„+i, G„, ...)), the previous equation becomes 

Tn = G„+ir_i 

+ G„(Gir_i + ro) 

(A5) 



Gn- 


-1 


(G2 + f)r_i + Giro 


Gi 


n 




+ 



Using (12) for the lowest-orders reduced Lagrangian T/., that are already known, the previous 
formula can be rewritten 



Tn — G„+i(r_i — dS-i) + G„(ro — dSo) + G„_i(ri 



dSi) 



+ ... + Gi 



(Gi) 



Trr-i + 



(Gi)- 



+ dS„ 



Gn+l-(^~l + G„-Wo + ... + dSn + R?; 



(A6) 



where R„ indicates all other terms, that are already known, since they do not involve 

(G„-|-i, Gn, ■■■)■ 

Equation (A6) shows that the pivotal matrix M„ is given by the set of matrices uJ-i, uJq, etc., 
acting on the set of unknown components of (Gn+i, G„, ...). The matrix M„ is the same at any 
(high) order. It is exactly given by the reduced Lagrange matrix at lowest orders. 
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As a consequence, the pivotal matrix Mqo for all high orders is identified as soon as the set of 
{uJ^i,uJq, ...) is observed to be invertible on the set of unknown components of (G„+i, G„, ■■■), i.e. 
as soon as c<7_i +a;o + .-. becomes invertible. We will call rib the order such that aJ_i +^o + ■■■+'^nt 
is invertible, whereas uJ^i +uJq + ... + uJn^^i is not. 

At that point, the basic picture of A) has become a simple picture B), which includes two 
stages. At low orders it consists in dealing with non invertible pivotal matrices changing at each 
order, and in choosing r„ such that it both fulfils the requirement and leads to an interesting 
invertible matrix lJ„i + lJq + ... + ^rif Then at high orders, the induction becomes just a matrix 
inversion M^^, as in the initial basic picture A). 

C) The simple picture B) has to be refined. Between these two stages, an intermediate stage 
takes place, since usually ric > nb + 1. For n £ {ub + 1, n^ + 2, ...,nc — 1}, the higher-orders pivotal 
matrix is already known but not yet efficient. 

The reason is that formulae (A4)-(A6) hold only for orders that are high enough, because of 
the coefficients generated by expanding the exponentials. If the factors involving (G„+i, G„, ...) 
have some coefficients non unity, then formula (A6) does not hold, which spoils the conclusion. 

But this concerns only low orders. Formula (A5) shows that the coefficients will be unity as 
soon as n > 2nb + 1, which means that ric ^ 2nb + 2. 

As an example, consider cJ„ for n = 0. When computing the next order r„+i = Fi, then Fq is 
already known, but it is not yet efficient: in formula (A6) 

Fi = G2F_i + Gi (iGiF_i + Fo) + dS^ 

/ G2-a;_i + Gi-ZJo + dSi , 

the operator to be inverted is not just the set of (cZJ_i,ZJo) because of the factor 1/2 in the first 
line, which comes because the generator Gi outside the parenthesis has the same order as the 
generator Gi inside the parenthesis. For all higher orders, this will not happen, as is illustrated 
by the next order 

Fs = GgF.i + G2(GiF_i + Fo) + o.t. 

= G2-UJ-1 + Gi-cJo + o.t. , 

where o.t. is used for "other terms", in order to avoid writing uninteresting terms. 

D) The order mixing can also slightly complicate the picture of C), by spoiling the linear 
algebraic framework, mainly at order n = 1. Indeed, the first equation to be solved for G2 is 
(A3). However, if some of the components of Gi are still not determined at that point (this is 
fairly general as uj-i is usually not invertible), then they can be involved in a differential equation. 



60 

Indeed, Tq can be undetermined at that point, and equation (A3) must be let under its initial form 
(12) 

Ti = (g2 + ^Gi) -w^i + Gr^o + dSi , 

which is now a differential equation for Gi, and may even be non- linear in the unknown 
components of Gi. This can make the scheme much more complicated: even solvability conditions 
may be difficult to identify. 

E) Finally, one last point has to be taken into account as well and still makes the scheme 
more elaborated than the picture D) above. The pivotal matrix M„ determines the unknown 
components of the generator (G„+i, G„, ...), but this can generate non- zero time-component G^ 
for the generator. 

For a time-independent transformation, the requirement G^ = 0, reduces the dimension of the 
effective generator G„. Then the pivotal matrix can be inverted only if some integrability condi- 
tions are fulfilled. Another way of saying it is that r„ has seven components (seven requirements) 
whereas G„ has only six freedoms. The additional freedom comes from the gauge function S'„. 

Actually, the presence of this integrability condition for the pivotal matrix Mqo is completely 
general and comes because Mqo is antisymmetric. It is not invertible on the 7-dimensional space 
(q, p, t), and can be invertible only on a sub-space, e.g. on the 6-dimensional phase-space (q, p). 
For a symplectic Hamiltonian system, the high-orders pivotal matrix Mqo is indeed invertible when 
restricted to the phase-space, since the Lagrange 2-form uJs is invertible, and so is uJs- 

So, the gauge function Sn is not determined by the algebraic matrix inversion, but by the 
solvability condition for the matrix inversion. Furthermore, it appears in a differential equation. 
Existence of solution for this differential equation can involve other integrability conditions. For 
instance, in an equation such as 

deSn = fn , 

inverting deS implies the function /„ to have no gyro-average. 

As a result, both the algebraic and the differential integrability conditions must be played 
with so as to make r„ = 0. If it is not possible, one has to choose a non-zero reduced Lagrangian 
r„ 7^ 0. This means playing with the requirements also, and releasing them slightly, so as to make 
the integrability conditions fulfilled and at the same time to keep F as strongly reduced as possible. 

All these features do not spoil the algorithmic character of the reduction for high orders, because 
the differential scheme is very simple (the operators are just d^k) and in addition, at all n ^ ric, 
the algebraic scheme for (G^+i, G„, ...) is fixed, which makes it possible to conclude about the 
differential scheme for Sn so as to make the resulting reduction maximal. 
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At the end, the induction relations can be written in matrix form provided some coefficients 
of the matrix are differential operators. By such a redefinition of the matrix Mqo, the induction 
relation for high orders n ^ Uc just relies on a matrix inverse M^^. Then, the basic picture of 
formula (Al) becomes efficient: equations (12) are solved at arbitrary order through a formula 
completely analogous to (Al), even if the framework is much more elaborated. We want to stress 
this fact because the order mixing and the presence of integral operators may hide the triviality 
of the induction mechanism. 

2. Resulting procedure in three stages 

The previous subsection shows that the reduction is performed in three stages. The first stage 
corresponds to the first few orders n ^ rii,. The work consists in verifying that the freedoms can be 
used both to make the solvability conditions satisfied and to get an interesting invertible matrix 
Moo- At the end n = nb, the invertible high-order pivotal matrix Mqo becomes identified, and the 
first stage is ended. 

The second stage corresponds to a transition stage. The pivotal matrix for high order is 
identified, but it is still not efficient at that order. The goal is only to check that the solvability 
conditions can be satisfied at these intermediate orders. 

The third stage begins at order n = Uc, i.e. as soon as the matrix to be inverted becomes Mqo. 
From that order on, it is sure that the reduction can be performed to any order in the Larmor 
radius. As the matrix is now invertible, the solution exists and is unique to each order, and the 
process becomes fully algorithmic. 

In order to get a formula analogous to (Al), the pivotal matrix must be extended to include 
the gauge function S'„, and some coefficients of the inverted matrix M^^ are then integral 
operators. In addition, in order to deal with the order mixing, some intermediate quantities must 
be introduced to isolate the components of (Gn+i, G„, G„_i) that are already known from the 
ones that are not identified yet. 

For example, if the pivotal matrix Mqo involves only ZJ-i and Uq. Then, the equation (12) or 
(A6) writes 

Tn = G„+i-ZJ_i + Gn-Wo + dSn + Rn , (A7) 

where R„ indicates all terms of (12), that do not depend on the unknowns, which are the gauge 
function Sn and some components of (G„+i, G„). These last quantities can be grouped into one 
single vector 

En '■= {Gn+l,Gn,Sn) ■ (A8) 
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The pivotal matrix M„ is then extended to act on all g„ (including the gauge function) in (A7) 
and is defined by 

Mn-gn := Gn+i-cZ;_i + Gn-UJQ + dSn . 

As announced, some of its coefficients (the ones acting on the component Sn) are differential 
operators. With these conventions, equation (A7) writes 

Now, some of the components of (Gn+i, G„) are already identified at that order. Let us denote 
them by the index a, and the remaining components of g, which are not identified are denoted by 
the index oo: 

with (gn)a fully identified and all terms of (gn)oo fully unknown. The Lagrangian writes 

r„ = {Mn)aign)a + (M„)oo-(gn)oo + Rn , (A9) 

with obvious definitions for the linear operators (M„)a and (M„)oo- By assumption, the quantities 
R„ and (M„)a(g„)a are known; in addition, (M„)oo = M„ = Mqo is known and invertible. As a 
consequence, the induction relation writes 

(gn)oo = M^l- [r„ - (M„)a-(gn)a " Rn] • (AlO) 

It is explicit and makes the basic picture (Al) apply to all orders n ^ ric- Some coefficients of M^^ 
are integral operators, since in the inverse matrix Mqo some coefficients are differential operators. 
A few comments are in place. First, some components of Gn+i remain non-identified after the 
order n; they must be excluded from (gn)oo to get an invertible matrix, because they are elements 
of the kernel of a;_i and will be determined at the next order; this is well illustrated by (59) and 
(63)-(64). Second, the components (gn)a can be extracted from g„ and its term {Mn)aiEn)a can 
be grouped with R„ (see formula (61)), which plays the same role. Last, the reduced Lagrangian 
r„ is in principle taken to be zero, but it was kept free because integrability conditions for Sn 
can make it necessary to choose some of its components non-zero; then, it can be included in the 
vector g„, as is done in (59) and (63)- (64). 

The final algorithm to be iterated for the n-th-order term is trivial: in formula (AlO), replace 
the lowest orders terms by their expression, already known, then compute the Lie derivatives 
involved in the term R„, and last apply the matrix product with M^^. The mechanism involves 
just two kinds of operations, derivatives and a matrix product, which can be easily implemented 
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to arbitrary order using computer-assisted formal calculus. 

The basic idea shown in (Al) and (AlO) explains why Lie transforming the Lagrangian 1-form 
has the advantage of algebraic equations, which makes it easy to reduce also the averaged part of 
the reduced motion, and thus to get non-minimal guiding-center reductions. Indeed, computations 
for the non-minimal requirements are treated the same way as for the minimal ones, the only 
difference concerns the priority: if all requirements can not be satisfied, then the order of priority 
may impose the requirements to be preferred and the ones to be released. This is an essential 
advantage of Lie transforming the Lagrangian. 

But the overall process is much more involved than the method relying on a Lie transform of 
the equations of motion. This last has the essential advantage of relying on just a gyro-integral, 
which makes it much more efficient to work on the fluctuating part of the reduced dynamics and 
to perform the minimal guiding-center reduction, as is clear in [12]. 

In both cases, the reduction relies on explicit induction relations, but when working with 
the Lagrangian, the algorithmic stage (third stage introduced above) is efficient only for higher 
orders. For lowest orders, the reduction is not systematic at all, the choices are crucial to make 
the reduction work or not, but they must be guessed rather than derived. In addition, many 
solvability conditions appear in the process, and there is no a-priori guarantee that they can be 
satisfied. 

In the case of the guiding-center reduction, good choices appear rather naturally, solvability 
conditions come as easily satisfied, and the reduction can be considered as rather straightforward, 
but two specificities must be taken into account. 

Indeed, as expected, at each order, the fluctuating part of G„ is imposed by the minimal 
requirement (16), which means to put to zero the gyro-fluctuating part of the Lagrangian; and the 
averaged part is imposed by the other requirements (17) and (19), which mean to put to zero the 
averaged part of the Lagrangian as well (except that Ti =Ji). 

However, one of the components of (G„+i, G„, ...) that remains not identified is already present 
in equation (A7): avg(r^) remains as a parameter in the right-hand side of (AlO). 

Furthermore, the integrability conditions on Sn can not be fully satisfied, one of the optional 
requirements (17) must be dropped; so, the average component F (or alternatively F ) is not 
zero but used to make the integrability condition satisfied. Accordingly, one of the freedoms (the 
average gauge function avg(S'„)) remains undetermined for the maximal reduction. To determine 
it, a prescription must be added. For the simplest maximal reduction, it is put to zero. 

All these features will suggest to define and decompose the vector g„ in a different way as in 
(A8) and (A9), by including in this vector only the unknowns that are involved at that order (see 
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formula (59)), and by distinguishing between the unknowns that will be identified and the ones 
that will remain parameters (see formulae (64)- (63)). 
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